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INTRODUCTION 


Genomic  DNA  stores  all  the  information  for  living  organisms.  The  faithful  duplication  the 
maintanance  of  DNA  are  therefore  critical  for  any  organisms.  Errors  in  replicating  chromosomal 
DNA  can  result  in  genomic  instability,  leading  to  various  human  diseases,  such  as  cancer,  immune 
system  disorder,  muscle  dystrophy,  and  neurodegenerations.  Thus,  maintaining  genomic  integrity  is 
vital  to  the  normal  growth  of  cells  and  to  human  health.  Maintenance  of  genome  integrity  during  S 
phase  depends  in  part  on  the  activity  and  regulation  of  the  replication  enzymes,  including  replicative 
helicase  called  the  MCM  complex  to  preserve  the  genetic  information.  In  contrast,  deliberate  errors 
are  introduced  during  maturation  of  the  immune  system  by  modifying  enzymes  called  deaminases. 
The  cells  response  to  different  forms  of  damage  is  fundamental  to  its  ability  to  repair  itself  when 
challenged  by  environmental  or  chemical  insults.  The  major  scientific  goal  of  this  proposal  is  to 
understand  mechanisms  of  maintaning  genome  integrity  in  cells  by  applying  a  combined  genetic, 
biochemical,  and  structural  methodologies.  These  studies  will  contribute  to  the  understanding  of  the 
development  of  cancer,  including  leukemias  and  breast  cancer. 
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BODY 

Final  accomplishments  of  the  approved  aims  for  grant  period  07/01/2005-06/30/2008. 

Aim  1:  in  vitro  function  of  MCM  helicase: 

MCM  regulator  structure/function:  While  archaeal  MCM  is  functional  as  a  hexamer  and  double 
hexamer,  the  eukaryotic  MCM  is  inactive  as  a  hexemeric  complex.  We  have  investigated  the  in  vitro 
function  of  an  archael  MCM  function  through  structure  guided  mutagenesis  and  biochemical  study,  which 
reveals  that  the  central  channel  plays  a  very  important  role  in  binding  DNA  for  helicase  activity.  Our 
previous  crystal  structure  of  Methanothermobacter  thermautotrophicus  MCM  (mtMCM)  reveals  that  the 
dHex  has  a  central  channel  of  1 18  A  in  length  from  end  to  end.  The  channel  has  a  diameter  ranging  from  47 
A  at  the  widest  point  to  23  A  at  the  narrowest  point,  as  measured  from  side  chain  to  side  chain.  There  are 
many  positively  charged  residues  located  on  the  inner  surface  of  the  channel,  generating  a  strongly  positive 
channel;  in  contrast,  the  outside  surface  is  mostly  negative.  The  charge  distribution  immediately  suggests  a 
DNA  binding  position.  For  this  proposed  work,  we  have  mutated  six  charge  residues  on  the  central  channel 
of  mtMCM  (R116,K117,R122,K178,  and  R223),  all  located  on  the  inner  central  channel  surface  of  the 
mtMCM.  We  showed  here  that  mutation  of  any  of  these  five  residues  to  alanine  resulted  in  lower  helicase 
activity  compared  with  the  wt  protein.  Although  all  mutants  showed  ATPase  activity  comparable  to  that  of 
wt,  all  five  mutants  had  consistently  reduced  ssDNA  binding  and  four  out  of  the  five  mutants  showed 
reduced  dsDNA  binding,  an  observation  consistent  with  the  role  of  these  residues  in  interaction  with  DNA. 
This  work  has  been  published  in  Biochemistry,  2008,  Aug  29.  Epub  ahead  of  print,  (see  Appendix  1  for 
details). 

With  MCM  regulation  for  helicase  activity  and  DNA  replication,  recent  finding  reveals  that  if  a  complex  of 
GINS  (tetrameric  complex)  binds  to  MCM  in  complex  with  CDC45,  the  MCM  becomes  active  in 
unwinding  DNA.  Thus  GINS  complex  is  the  “missing”  link  that  regulates  and  activates  MCM  helicase 
activity,  and  thus  DNA  replication  activity.  The  GINS  complex,  which  contains  the  four  subunits,  Sld5, 

Psfl ,  Psf2  and  Psf3,  is  essential  for  both  the  initiation  and  progression  of  DNA  replication  in  eukaryotes. 
GINS  associates  with  the  MCM2-7  complex  and  Cdc45  to  activate  the  eukaryotic  MCM  helicase.  It  also 
appears  to  interact  with  and  stimulate  the  polymerase  activities  of  DNA  polymerase  8  and  the  DNA 
polymerase  a-primase  complex.  To  further  understand  the  functional  role  of  GINS,  in  collaboration  with 
Dr.  Jerard  Hurwitz’s  lab,  we  determined  the  crystal  structure  of  the  full-length  human  GINS  hetero- 
tetramer.  The  four  subunits  each  have  a  major  domain  composed  of  an  a-helical  bundle-like  structure. 

With  the  exception  of  Psfl ,  other  subunits  have  a  small  domain  containing  a  three-stranded  (3-sheet  core. 
Each  full-length  protein  in  the  crystal  has  unstructured  regions  that  are  all  located  on  the  surface  of  GINS 
which  are  probably  involved  in  its  interaction  with  other  replication  factors.  The  four  subunits  contact  each 
other  mainly  through  a-helices  to  form  a  ring-like  tetramer  with  a  central  pore.  This  pore  is  partially 
plugged  by  a  16-residue  peptide  from  the  Psf3  N-terminus  which  is  unique  to  some  eukaryotic  Psf3  proteins 
and  is  not  required  for  tetramer  formation.  Removal  of  this  N-terminal  16  residues  of  Psf3  from  the  GINS 
tetramer  increased  the  opening  of  the  pore  by  80%,  suggesting  a  mechanism  by  which  accessibility  to  the 
pore  may  be  regulated.  The  structural  data  presented  here  indicate  that  the  GINS  tetramer  is  a  highly  stable 
complex  with  multiple  flexible  surface  regions.  This  work  has  been  published  in  PNAS,  2007, 104:12685- 
90.  (see  Appendix  2). 

A  model  of  how  the  human  GINS  tetrameric  complex  interacts  and  coordinates  the  activities  of  its 
binding  partners  is  proposed  in  Fig.  6.  In  this  model,  we  suggest  a  direct  contact  between  Psfl  and  Pol  8 
because  an  interaction  between  S.  cerevisiae  Psfl  and  Dpb2  (the  second  largest  subunit  of  Pol  8)  was 
detected  in  a  yeast  two  hybrid  screen.  Because  archaeal  Gins23  binds  to  the  N  terminus  of  MCM  and  an 
interaction  of  Psf3  with  MCM  has  been  detected  in  the  yeast  two  hybrid  system  (S.  Azuma  and  H. 
Masukata,  unpublished  data,  cited  in),  our  model  depicts  Psf3  in  contact  with  MCM.  We  assume  that 
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Psf2  contacts  the  Pol  a  primase  complex  based  on  the  report  that  archaeal  Gins23  interacts  with 
primase.  The  model  also  allows  MCM,  Cdc45,  and  GINS  to  contact  each  other  based  on  the  isolation  of 
this  complex  from  Drosophila  eggs.  Finally,  we  also  suggest  that  GINS  must  interact  with  Pol  e  and  the 
Pol  a-primase  complex  to  coordinate  leading  and  lagging  strand  synthesis,  respectively. 

Aim  2:  in  vivo  effects  of  helicase  function 

1).  To  examine  in  vivo  effects  of  helicase  function  and  in  particular  MCM  roles  in  maintaining 
genome  integrity  in  response  to  damage. 

We  performed  a  screen  for  mcm  mutants  that  are  functional  for  DNA  replication,  but  defective  in 
response  to  damage.  This  screen  was  unexpectedly  difficult  to  implement  and  generated  no  new  strains. 
To  examine  in  vivo  effects  of  helicase  function  and  in  particular  MCM  roles  in  maintaining  genome 
integrity  in  response  to  damage.  We  used  existing  and  newly  generated  mutants  in  MCM  proteins  to 
investigate  how  MCMs  contribution  to  genome  stability  during  chemical  damage.  We  have  initiated  a 
screen  for  mcm  mutants  that  are  functional  for  DNA  replication,  but  defective  in  response  to  damage. 
The  screen  design  is  as  follows.  We  mutagenized  in  vitro  plasmids  containing  wild  type  mcm2^ ,  mcm4^ , 
and  mcm?^.  These  were  transformed  into  strains  containing  temperature  sensitive  mutations  in  mcm2, 
mcm4  and  mcm7,  and  selected  for  viability  at  36°C,  at  which  temperature  the  chromosomal  MCM  is 
inactivated.  Cell  viability  depends  upon  the  plasmid-borne  MCM.  This  ensures  that  the  plasmid  copy 
is  functional  for  replication.  Then,  we  screened  the  surviving  transformants  for  sensitivity  to 
hydroxyurea. 

Our  first  screen  was  conducted  under  conditions  that  are  lethal  for  checkpoint  mutants  such  as 
tsradS.  We  did  not  observe  any  hydroxyurea- sensitive  MCM  clones.  However,  following  the  old 
genetics  adage  “absence  of  evidence  is  not  evidence  of  absence”,  we  are  re-calibrating  our  screening 
conditions  to  use  a  concentration  of  HU  where  we  can  see  a  phenotype  in  Acdsl  checkpoint  mutants. 
This  strain  is  more  resistant  to  HU,  and  our  other  experiments  suggest  that  the  Cdsl  kinase  may  be 
involved  directly  in  MCM  function  (see  below),  which  makes  it  particularly  relevant. 

Meanwhile,  we  performed  a  candidate  approach.  Recently  several  new  MCM  mutations  have  been 
identified  in  the  literature,  particularly  one  associated  with  adenocarcinoma.  We  constructed  this  allele 
into  Mcm4  in  fission  yeast  to  investigate  its  effects  in  our  system.  We  determined  that  this  allele  is 
functional  for  DNA  replication,  but  causes  sensitivity  to  DNA  damage  (that  is,  it  fulfills  our 
requirements).  We  also  determined  that  one  of  our  temperature  sensitive  alleles,  the  mcm4ts-dg  allele, 
is  sensitive  to  HU  and  MMS. 

Our  preliminary  studies  suggest  that  the  sensitivity  for  DNA  damage  may  reflect  the  inability  of  the 
checkpoint  kinase  Cdsl  (Chk2)  to  phosphorylate  mutant  Mcm4ts-dg  protein.  This  analysis  was 
published  in  Mol.  Cell.  Biol.  2008,  28:1724-38)  (See  Appendix  3). 


2).  To  determine  domains  of  MCMs  responsible  for  interacting  with  other  proteins. 

We  identified  several  molecular  associations  between  MCMs  and  other  factors  important  for  genome 
stability.  We  identified  an  association  between  Mcm2  and  the  conserved  histone  acetyltransferase  Mstl 
(hTip60)(G6mez  et  al,  2008  Genetics  in  press)  (Appendix  4).  We  also  determined  that  Mcm4  associates 
the  conserved  homologous  recombination  protein  Rhp51  (hRadSl)  and  with  the  checkpoint  kinase  Cdsl 
(hChk2)  (Bailis  et  al,  2008  Mol.  Cell.  Biol.  28:1724-38)  (see  Appendix  3).  The  mutants  identified  in  Fig 
1  suggest  that  at  least  part  of  this  association  is  mediated  by  the  Mcm4  amino  terminus. 
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Because  the  mstl  mutant  causes  HU  sensitivity,  we  examined  this  phenotype  more  closely  and 
determined  that  Mstl  interacts  genetically  with  many  homologous  recombination  proteins.  Using  two- 
hybrid  and  co-immunoprecipitation,  our  data  suggest  that  this  may  be  mediated  by  interactions  with 
Rad22.  We  are  investigating  whether  the  activity  of  the  Mstl  histone  acetyltransferase  is  required  for 
homologous  recombination  (Gomez  et  al,  2008  Genetics  in  press).  We  conclude  that  Mstl  histone 
acetyltransferase  is  required  for  genome  stability  during  HU  arrest,  and  we  are  examining  how  this  may 
reflect  MCM  association. 


Aim  3:  Structure-function  study  of  deaminase  enzymes 

Apolioprotein  B  (APOB)  mRNA-editing  enzyme  catalytic  polypeptide  (APOBEC)  is  a  family  of 
cytidine  deaminases  found  exclusively  in  verterbrates.  APOBEC  nucleic  acid  deaminases  modify  genes 
by  deaminating  cytosines  in  mRNA  coding  sequences  and  in  ssDNA.  Additionally,  these  enzymes  can 
inhibit  the  replication  of  retroviruses,  such  as  the  human  immunodeficiency  virus  (HIV)  and  hepatitis  B 
virus  (HBV),  and  retrotransposons.  The  APOBEC  family  is  composed  of  APOBEC- 1  (Apol),  Apo2, 
AID,  and  APOBEC-3  (3A,  3B,  3C,  3F,  3G,  and3H).  Apol ,  the  first  member  to  be  characterized, 
deaminates  C^^^^U  in  the  APOB  mRNA  thereby  creating  a  premature  stop  codon,  which  results  in  a 
truncated  protein  with  a  different  function.  Of  the  APOBEC3  subgroup,  APOBEC-3B  (A3B), 
APOBEC-3F  (A3F)  and  APOBEC-3G  (A3G)  have  two  cytidine  deaminase  (CDA)  domains  and  inhibit 
HIV-1  replication  in  the  absence  of  the  HIV  viral  infectivity  factor  (Vif).  APOBEC3  proteins  also  shield 
the  human  genome  from  the  deleterious  action  of  endogenous  retrotransposons:  A3A,  A3B,  A3C  and 
A3F  inhibit  LINE  1  and  Alu  retrotransposition. 

Some  APOBEC  has  two  homologous  cytidine  deaminase  domains,  such  as  Apo3G,  Apo3F,  and  Apo3B. 
However,  others  only  have  one  cytidine  deaminase  domain,  such  as  Apo2  and  AID  that  are 
phylogenetically  the  most  ancient  members  of  APOBEC  family.  AID  induces  somatic  hypermutation 
(SHM)  and  class  switch  recombination  (CSR)  in  activated  germinal  center  B  cells.  Point  mutations  in 
AID  are  responsible  for  an  immunodeficiency  disease,  Hyper-IgM-2  (HIGM-2)  syndrome,  which  is 
characterized  by  a  deficiency  in  isotype-switching  and  high  affinity  antibody  formation.  Apo2,  also 
known  as  ARCD-1 ,  is  ubiquitously  expressed  at  low  levels  in  both  human  and  mouse  and  highly 
expressed  in  cardiac  and  skeletal  muscle.  Pro-inflammatory  cytokines,  TNF-a  and  IL-ip,  induce  Apo2 
expression  in  hepatocytes  via  the  NF-kB  response  elements  identified  in  the  Apo2  promoter  region. 
Apo2  and  A3A  are  the  only  two  APOBEC  enzymes  that  have  not  yet  been  reported  to  deaminate 
cytosine  in  ssDNA  or  RNA  in  vitro.  Apo2  is  encapsulated  into  HIV-1  virions  when  co-expressed  with 
Avif  HIV-1  DNA  in  293T  cells.  However,  studies  fail  to  show  that  Apo2  and  nearly  all  of  the  single 
domain  APOBEC  enzymes  inhibit  HIV-1  viral  replication,  except  for  A3C,  which  exhibits  weak  HIV-1 
antiviral  activity. 

A  comprehension  of  the  molecular  mechanisms  of  the  APOBEC  enzymes  has  been  limited  by  the  lack 
of  3-dimensional  structures  of  APOBEC  proteins  that  use  large  nucleic  acid  substrates.  We  have 
achieved  a  major  breakthrough  through  a  collaboration  of  the  Chen  and  Goodman  laboratories  that 
resulted  in  the  first  high-resolution  crystal  structure  (2.5  A)  for  an  APOBEC  protein,  APOBEC-2, 
reported  in  Nature,  Ref.  (Prochnow  et  al..  Nature,  2007, 445,447-451)  (See  Appendix  5).  The  structure 
is  made  up  of  an  unusual  rod-shaped  tetramer  that  differs  significantly  from  the  square  tetramer  of  the 
free  nucleotide  C  deaminase.  Although  an  enzymatic  activity  for  APOBEC-2  has  yet  to  be  identified,  its 
close  homology  with  AID  enabled  us  to  use  APOBEC-2  as  a  surrogate  to  predict  how  mutations  in 
analogous  regions  might  alter  AID  activity.  This  strategy  proved  to  be  remarkably  effective,  where  we 
found  that  mutations  that  were  predicted  to  influence  dimer  and  tetramer  interface  interactions  caused 
significant  reductions  in  AID  activity.  In  an  analysis  of  “APOBEC2  structure  -  AID  function  by  proxy”, 
we  examined  5  HIGM-2  human  disease-associated  AID  mutants  that  fail  to  make  high-affinity 
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antibodies.  We  found  that  one  is  mutated  at  the  tetramerization  interface,  4  are  mutated  on  an  exposed 
monomer  surface  and  1  mutation  is  located  proximal  to  the  catalytic  active  center.  These  data  illustrate 
the  importance  of  the  multimeric  interactions  in  the  biochemical  function  of  AID,  and,  more 
importantly,  in  its  biological  function. 


Aim  4:  Initiation  of  Human  Hypermutation  by  the  APOBEC  Family  of  Nucleic  Acid 
Deaminases 

Almost  all  mutations  are  either  inconsequential  or  deleterious.  And  yet  mutations  play  a  central  role  in 
evolution,  adaptation  and  fitness  from  microorganisms  to  humans.  The  “trick”  is  to  select  and  then 
amplify  cells  carrying  the  very  few  beneficial  mutations.  Non-patogenic  microorganisms,  Escherichia 
coli  and  Salmonella  typhimurium  use  error-prone  DNA  polymerase  to  enhance  cell  fitness  in  highly 
competitive  environments,  e.g.,  limited  food  resources,  whereas  their  closely  related  pathogens  shut  off 
mismatch  repair  pathways  resulting  in  “hypermutability”.  In  humans,  somatic  hympermutation  (SHM)  of 
the  variable  region  of  immunoglobulin  (Ig)  genes  is  responsible  for  generating  high  affinity  antibodies, 
and  class-switch  recombination  (CSR),  occurring  concomitantly  with  SHM,  determines  where  in  the 
human  lymphatic  system  the  high  affinity  antibodies  act  to  neutralize  invading  antigens.  SHM  and  CSR 
occur  solely  in  activated  B  cells.  A  different  type  of  hypermutation  process,  occurring  in  T  cells,  is 
responsible  for  helping  to  inactivate  HIV-1  AIDS. 

We  have  made  substantial  progress  in  dissecting  the  biochemical  basis  for  the  initiation  of 
hypermutation  by  two  members  of  the  APOBEC  family  of  nucleic  acid  cytidine  deaminases,  AID 
(activation-induced  cytidine  deaminase),  and  APOBEC3G  (A3G)  (2).  AID  is  synthesized  only  in  B  cells, 
where  tightly  regulated  synthesis  occurs  for  a  period  of  several  hours.  AID  is  required  for  SHM  and  CSR 
of  human  Ig  genes,  acting  during  transcription  of  V-genes  on  the  non-transcribed  strand.  A3G  is 
encapsulated  along  with  the  genomic  RNA  in  the  HIV-1  virion.  A3G  plays  an  important  role  in  blocking 
infection  by  HIV-1  in  non-permissive  cells.  Both  enzymes  deaminate  cytidine  to  form  uracil  (C  ^  U)  on 
ssDNA  substrates.  Strict  regulation  of  deamination  is  essential,  because  unregulated  deamination,  caused 
by  expression  and  subsequent  action  of  deaminases  in  the  wrong  place  at  the  wrong  time,  are  likely  to 
lead  to  serious  disease,  e.g.,  AID-initiated  B-cell  lymphomas,  by  causing  excessive  levels  of  mutation 
(typically  a  million-fold  above  normal  mutation  frequencies). 

Our  primary  goal  is  to  study  the  biochemical  and  physical  biochemical  mechanisms  for  the  APOBEC 
family  of  “mutators”.  The  close  collaboration  of  Xiaojiang  Chen’s  laboratory  with  our  laboratory, 
fostered  by  the  Army  Grant  support,  has  enabled  us  to  marry  biochemical,  physical  biochemical  analysis 
with  structural  analysis  (published  in  Nature,  2007,  see  Appendix  5).  During  the  grant  period  two  years 
ago,  we  obtained  the  first  high-resolution  crystal  structure  for  an  APOBEC  enzyme,  APOBEC-2  (Apo2), 
which  is  present  in  heart  and  skeletal  tissue.  Based  on  recent  data  obtained,  where  we’ve  successfully 
cleaved  a  GST-tag  from  A3G,  we  are  in  an  excellent  position  to  obtain  crystals  of  full-length  A3G.  This 
result  has  the  potential  to  initiate  breakthrough  toward  determining  structure-function  relationships  for 
A3G-catalyzed  deamination  of  HIV-1  and  for  A3G  interaction  with  the  viral  infectivity  factor  (vif), 
which  is  responsible  the  destruction  of  A3G  in  T  cells.  If  we  can  model  the  A3G-vif  interaction  in  vitro, 
then  that  would  offer  clear  strategies  to  protect  A3G  against  apoptotic  destruction,  which  could  lead  to 
effective  treatment  against  infection  by  the  AIDS  virus. 

Biochemical  behavior  of  AID 

We  have  shown  previously  that  AID  binds  to  randomly  to  ssDNA,  irrespective  of  base  composition, 
and  catalyzes  deamination  of  C  favoring  WRC  (W  =  A/T,  R  =  purine)  hot  spot  motifs  and  disfavoring 
SYC  (S  =  G/C,  Y  =  pyrimidine)  cold  spot  motifs.  The  enzyme  scans  processively  along  “naked”  ssDNA 
and  tracks  processively  on  the  non-transcribed  strand  of  a  moving  transcription  bubble.  The  deaminations 
occur  in  random  clusters  of  between  4  and  10,  usually  containing  at  least  one  WRC  triplet,  interspersed 
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with  sparsely  deaminated  regions,  i.e.,  regions  with  WRC  motifs  that  are  “ignored”.  The  question  is: 
what  are  the  biochemical  mechanisms?  AID  is  believed  to  act  as  a  dimer  in  vivo  and  its  found  to  be 
phosphorylated  at  Ser38  in  B  cells. 

Impact  of  phosphorylation  null-mutants  on  the  deamination  specificity  of  AID  in  relation  to  human 
hyper-IgM-2  disease 

AID,  purified  from  mouse  B-cells  has  been  shown  to  be  phosphorylated  on  Ser38,  Tyrl84,  and  possibly 
Thr27.  Although  phosphorylation  does  not  appear  to  be  B-cell  specific,  AID  phosphorylation  at  Ser38 
was  suggested  to  be  the  modification  targeting  AID  to  transcribed  dsDNA  through  interactions  with 
RPA.  Since  the  phosphorylation  mutant  S38A  has  differing  levels  of  CSR  deficiency  in  ex  vivo 
complementation  experiments,  we  have  carried  out  a  comparative  biochemical  analysis  using  wild  type 
(WT)  AID,  mutant  S38A  and  additional  phosphorylation  defective  mutants,  to  examine  the  roles  of 
phosphorylation  that  could  account  for  a  loss  of  AID  CSR  activity  in  vivo.  Also,  the  mutation  of  Ser43  to 
Pro  (S43P)  was  made  to  characterize  the  phosphorylation-null  HIGM-2  mutation  found  in  a  patient. 

A  key  new  discovery  emerged  from  the  analysis  with  the  phosphorylation-null  mutants  with  respect  to 
human  hyper-IgM-2  syndrome,  which  is  characterized  by  an  accumulation  of  IgM  antibody  forms  owing 
to  an  inability  to  perform  CSR.  An  examination  of  phosphorylation-null  mutant  deamination  specificities 
showed  the  mutants  to  be  much  more  active  in  two  triplet  deamination  motifs,  a  hot  spot  AGC  motif  and 
an  intermediate  GGC  motif,  which  is  published  by  Pham,  et  al.,7.  Biol.  Chem.  283, 17428-17439  (2008) 
(see  Appendix  6).  The  spectral  differences  in  GGC  and  AGC  for  the  HIGM-2  AID  mutant  S43P  have  the 
following  potentially  important  human  biological  consequence.  The  GGC  and  AGC  motifs  occur  with 
exceptionally  high  frequency  in  human  switch  regions.  There  are  196  GGC  motifs  in  the  S-region  of 
IgG4  and  126  in  the  S-region  of  IgE,  which  are  by  far  the  most  prevalent  triplet  motifs  in  these  two 
switch  regions,  comprising  15%  in  IgG  and  19%  in  IgE.  High  levels  of  C^U  deaminations  targeted  to 
GGC  and  AGC  motifs  in  the  S-region  could  result  in  excessive  double-stranded  DNA  breaks  thereby 
causing  aberrant  recombination  resulting  in  the  elimination  of  CSR  in  HIGM-2  patients  carrying  an  S43P 
mutant  AID  allele.  Here  is  a  “prime”  example  of  how  subtle  biochemical  observations  emanating  from  a 
simple  in  vitro  system  can  shed  important  new  light  on  a  complex  biological  disease. 

Biochemical  behavior  of  APOBEC3G 

Like  AID,  APOBEC3G  (A3G)  scans  ssDNA  processively.  We  showed,  however,  that  unlike 
AID,  deamination  can  occur  directionally  with  a  strong  3 ’^5’  preference.  The  observation  of  directional 
deamination  was  entirely  unexpected  because  there  is  no  obvious  energy  source,  such  as  ATP  hydrolysis. 
During  the  previous  grant  period  we  made  significant  progress  in  understanding  the  biochemical  and 
structural  basis  for  A3G  scanning.  We  showed  that  the  deamination  polarity  appears  to  depend  on  the 
molecular  composition  of  A3G,  which  is  published  by  Chelico  et  al.,  J.  Biol.  Chem.  283,  13780-13791 
(see  Appendix  7).  We  found  that  A3G  exists  in  oligomeric  states  whose  composition  is  dictated  primarily 
by  interactions  with  DNA,  with  salt  playing  an  important,  yet  secondary  role.  Directional  deaminations 
correlate  with  presence  of  dimers,  tetramers  and  larger  oligomers  of  A3G  observed  by  atomic  force 
microscopy,  and  random  deaminations  appear  to  correlate  mainly  with  monomers,  which  are  favored  in 
the  absence  of  salt.  Presteady  state  single-turnover  reaction  rate  measurements  revealed  a  salt-dependent 
inhibition  of  C  deamination  toward  the  3’-ssDNA  region,  offering  a  molecular  basis  underlying  A3G 
deamination  polarity.  Therefore,  the  data  suggest  a  biochemical  basis  to  account  for  the  observation  in 
human  T  cells  in  which  HIV-1  infection  is  restricted  by  A3G-catalyzed  processive  C->U  deaminations 
occurring  on  single-stranded  DNA  with  marked  3 ’^5’  deamination  polarity.  We  suggest  that  diverse 
A3G  oligomerization  occurs  in  vivo  and  may  be  responsible  for  causing  HIV-1  pro  viral  DNA  mutational 
bias. 


Structure-function  analysis  of  APOBEC  enzymes 
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An  initial  structural  breakthrough  was  achieved  through  a  collaboration  of  the  Chen  and  Goodman 
laboratories  that  resulted  in  the  first  high-resolution  crystal  structure  (2.5  A)  for  an  APOBEC  protein, 
APOBEC-2,  reported  in  Nature,  2007  (see  Appendix  5).  We  have  recently  the  structural  work  of 
APOBEC3-G  and  structural  modeling  of  APOBEC3.  Based  on  our  structural  model,  we  have  introduced 
mutations  around  the  predicted  ssDNA  substrate  groove,  and  identified  residues  involved  in  substrate 
specificity,  ssDNA  binding,  and  deaminase  activity.  Our  results  provide  an  important  basis  for 
understanding  the  underlying  mechanisms  of  substrate  specificity  for  the  APOBEC  family. 

Significance 

The  humoral  response  is  able  to  recognize  and  attack  an  “unlimited”  number  of  possible  infectious 
agents  by  radically  diversifying  the  inherited  immunoglobulin  (Ig)  gene.  First,  the  Ig  gene  is  constructed 
from  a  variety  of  inherited  gene  segments  by  a  Rag-mediated  VDJ  recombination  event  which  joins 
together  a  variable  (V),  diversity  (D)  and  joining  (J)  segments  to  make  a  functional  Ig  gene  capable  of 
expressing  a  low  affinity  antibody.  In  order  to  completely  clear  an  infection,  B  cells  undergo  affinity 
maturation  to  produce  antibodies  with  stronger  affinity  for  antigen  and  with  appropriate  effector  function. 
Affinity  maturation  involves  two  diversification  events,  somatic  hypermutation  and  class  switch 
recombination  (SHM  and  CSR).  Our  principal  long-term  objective  is  to  focus  on  SHM  and  to 
reconstitute  the  hypermutational  process  in  an  in  vitro  model  system.  Our  initial  objective  is  to  explore 
the  action  of  activation-induced  cytidine  deaminase  (AID),  an  enzyme  required  for  both  SHM  and  CSR. 
We  have  shown  that  AID-catalyzed  deamination  of  cytosine  residues  on  ssDNA  simulates  three  hallmark 
properties  of  SHM  -  mutational  hot  and  cold  spot  sequences,  broad  clonal  heterogeneity  and 
transcriptional  dependence.  Active  transcription  is  also  a  requirement  for  SHM,  and  our  current  data 
suggest  that  AID  is  able  to  target  C  T  mutations  preferentially  on  the  non-transcribed  strand  of  a 
moving  transcription  bubble.  An  immediate  priority  of  ours  is  to  develop  an  in  vitro  human  transcription 
system  to  investigate  how  AID  is  targeted  to  actively  transcribed  V-  genes  but  not  C  genes.  A 
biochemical  understanding  of  AID,  whose  expression  is  confined  to  B  cells,  and  the  other  APOBEC 
homologs,  whose  expression  is  diverse,  should  provide  important  new  information  on  the  potential  risk 
factors,  which  we  have  referred  to  as  “at  risk  nucleic  acid  sequences”,  which  when  acted  upon 
inappropriately  by  APOBEC  nucleic  acid  C  deaminases  can  have  serious  deleterious  consequences 
resulting  from  APOBEC-induced  genomic  instability,  leading  to  cancer  and  perhaps  even  to 
neurodegenerative  disease.  Progress  in  understanding  the  molecular  interactions  governing  SHM,  which 
along  with  CSR  is  prerequisite  to  generating  high  affinity  antibodies,  would  benefit  substantially  by  the 
availability  of  an  in  vitro  model  system  to  study  AID’s  role.  We  intend  to  provide  this  invaluable 
biochemical  component. 

Future  Plans 

Although  we  have  progressed  towards  understanding  the  biochemical  mechanisms  responsible  for 
initiating  mutations  by  AID  during  SHM  and  by  APOBEC3G  during  HIV-1  inactivation,  an  in-depth 
analysis  will  clearly  require  high-resolution  structural  analysis  of  wild  type  and  mutant  APOBEC  family 
enzymes  alone  and  as  cocrystals  with  their  nucleic  acid  substrates.  The  Army  grant  has  proven  invaluable 
in  support  of  our  efforts  to  meld  biochemical  and  structural  analyses,  specifically  to  study  how  mutations 
can  provide  fitness  both  to  microorganisms  and  higher  organisms,  and  more  generally  to  gain  a  deeper 
understanding  of  the  molecular  basis  of  human  disease. 


KEY  RESEARCH  ACCOMPLISHMENTS: 

•  Cloned,  expressed,  and  purified  a  key  MCM  regulater,  GINS  tetramer  complex. 
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•  Crystallized  and  determined  the  crystal  structure  of  GINS  complex  and  gained  a  functional  insights 
in  knowing  how  GINS  interact  with  MCM. 

•  Characterized  the  functional  activity  of  MCM  by  mutagenesis  based  on  the  structural  prediction. 

•  Demonstrated  that  MCMs  interact  with  the  Cdsl  checkpoint  kinase  (human  Chk2,  budding  yeast 
Rad53),  in  which  MCM  may  be  the  target  for  the  kinase. 

•  Studied  the  biochemical  function  of  a  deaminase,  APOBEC3G. 

•  Crystallized  and  determined  the  atomic  structure  of  a  deaminase,  APOBEC2. 

•  Characterized  the  Structure-function  relationship  of  AID  and  APOBEC3G 


REPORTABLE  OUTCOMES  (in  the  form  of  peer  reviewed  publications): 
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Chang  P.  Wang,  G  Bermudez,  V.  Hurwitz.  J,  and  Chen.  X,  Structure  of  GINS  complex  and  the  functional 
insights  into  DNA  replication.  PNAS,  104:12685-90  (2007)  [Appendix  2] 
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Mar;28(5):  1724-38.  [Appendix  3] 

Go'mez,  EB.  Nugent,  RL.  Laria  S.  and  Forsburg  SL  Schizosaccharomyces  pombe  Histone 
Acetyltransferase  Mstl  (KAT5)  Is  an  Essential  Protein  Required  for  Damage  Response  and  Chromosome 
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CONCLUSION: 

Helicases  and  their  regulators  are  at  the  core  controlling  the  faithful  DNA  replication,  and  deaminases  are 
important  in  mutating  the  particular  part  of  genome  for  generating  high  affinity  antibodies  by  radically 
diversifying  the  inherited  immunoglobulin  (Ig)  gene.  Progress  in  understanding  the  molecular  interactions 
governing  the  assembly,  activation/inhibition  of  helicases  and  deaminases  from  are  valuable  for  the 
understanding  of  the  fundamental  basis  of  human  diseases  such  as  leukemia  and  other  types  of  cancer. 
Toward  this  goal,  we  have  made  considerable  progress  in  the  last  two  years,  as  manifested  by  the  high 
quality  publications  in  this  supported  direction.  In  the  remaining  years,  we  will  based  on  the  solid 
foundation  set  in  the  previous  years  to  further  the  structures/functional  studies  of  helicases  and  deaminases 
in  vitro  system  and  in  vivo  using  yeast  genetics. 
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ABSTRACT  Methanothermobacter  thermautotrophicus  minichromosomal  maintenance  protein 
(mtMCM)  is  a  75  kd  protein  that  self-assembles  into  a  double  hexamer  structure.  The  double  hexamer 
formed  by  the  N-terminal  region  of  mtMCM  has  a  highly  charged  (overwhelmingly  net  positive)  inner 
channel.  Here  we  investigate  the  effects  of  point  mutations  of  some  of  these  charged  residues  on  the 
biological  activities  of  mtMCM.  Although  all  of  the  mutants  were  similar  to  the  wild  type  in  protein 
folding  and  complex  assembly,  we  found  that  mutations  impaired  helicase  activity.  The  study  of  the  DNA 
binding  and  ATPase  activities  of  these  mutants  revealed  that  the  impairment  of  the  helicase  activity  was 
highly  correlated  with  a  decrease  in  DNA  binding,  providing  evidence  consistent  with  the  role  of  these 
charged  residues  of  the  inner  channel  in  interactions  with  DNA. 


The  progression  from  Gi  to  S  phase 
during  the  cell  cycle  is  highly  regulated.  A  group 
of  proteins  essential  for  the  successful  transition 
and  the  completion  of  S  phase  are  the 
minichromosomal  maintenance  proteins 
(MCM).  The  MCM  proteins  in  eukaryotes  are  a 
subgroup  of  six  proteins  (MCM2-7)  within  the 
much  larger  AAA+  ATPase  family  (7).  MCM2- 
7  share  a  highly  similar  stretch  of  amino  acids 
referred  to  as  the  MCM  box  {2-4)  located  in  the 
C-terminal  half  of  the  protein.  The  MCM  box 
contains  the  Walker  A  and  Walker  B  motifs 
responsible  for  the  binding  and  hydrolysis  of 
ATP. 

MCM  proteins  play  an  important  role  in 
the  initiation  of  replication,  as  these  proteins 
serve  as  replication  helicases  for  fork 
unwinding.  During  Gi  phase,  origins  with  the 
potential  to  serve  as  sites  for  the  initiation  of 
DNA  replication  are  marked  by  an  origin 
recognition  complex  (ORC).  Subsequent  binding 
by  Cdc6  and  Cdtl  promotes  the  recruitment  of 
MCM  proteins  to  form  the  pre-replication 
complex  (pre-RC).  Failure  to  recruit  MCM 


proteins  results  in  loss  of  origin  firing  and  Gl- 
phase  arrest.  Ensuing  activity  by  both  Dbf4 
dependent  kinase  (DDK)  and  cyclin  dependent 
kinase  (CDK)  promotes  the  binding  of  Cdc45 
and  recruitment  of  a  number  of  replication 
proteins  to  the  pre-RC,  ultimately  leading  to  the 
initiation  of  replication.  The  presence  of  all  six 
MCM  proteins  is  necessary  for  entry  and 
completion  of  S-phase  (reviewed  in  (5-7). 

Methanobacterium 

thermoautotrophicum  A/Thas  a  single  MCM 
(mtMCM)  protein  that  has  the  ability  to  self 
assemble  into  a  double  hexamer  structure  {8-13). 
Purified  mtMCM  protein  contains  ATPase, 

DNA  binding,  and  helicase  activities  in  vitro. 

The  crystal  structure  of  the  N-terminal  portion  of 
double-hexameric  mtMCM  (N-mtMCM) 
revealed  an  overwhelmingly  net  positively 
charged  inner  channel  surface  {10).  Here  we 
investigated  the  functional  role  of  five  residues 
located  on  the  inner  channel  surface  of  the 
double  hexameric  N-mtMCM  structure. 
Mutational  and  biochemical  studies  revealed  that 


these  residues  play  important  roles  in  helicase 
function. 


EXPERIMENTAL  PROCEDURES 

Cloning  and  mutagenesis.  The  gene  for 
Methanohacterium  thermoautotrophicum  AH 
MCM  was  cloned  into  the  pGEX-2T  expression 
vector  using  the  restriction  enzymes  Xho  I  and 
Xba  I  for  expression  as  a  GST-MCM  fusion 
protein.  Purified  wild-type  mtMCM  protein  was 
always  severely  degraded,  resulting  in  a 
heterogeneous  population.  We  identified  the  two 
major  cleavage  sites  for  this  degradation  by  N- 
terminal  sequencing  and  mass  spectrometry  of 
the  degraded  protein  fragments  (data  not 
shown).  These  two  cleavage  sites  occurred  prior 
to  R275  and  R338.  Both  of  these  residues  are 
located  in  the  linker  region  between  the  N- 
terminal  domains  and  C-terminal  helicase 
domain.  To  decrease  the  degradation  and 
provide  a  more  homologous  protein  product,  we 
generated  a  mutant  with  the  two  arginines 
mutated  to  alanines.  The  degradation  of  the 
mutant  was  greatly  reduced  compared  with  the 
unmodified  wild  type  protein.  At  the  same  time, 
this  mutant  protein  retained  helicase,  ATPase, 
and  DNA  binding  activities  at  levels  similar  to 
those  of  to  the  wt  protein  (data  not  shown).  All 
the  channel  surface  mutants  reported  in  this 
study  were  constructed  to  also  contain  these  two 
mutations  to  eliminate  differences  in  degradation 
between  different  constructs.  In  this  study,  the 
construct  containing  the  two  arginine  to  alanine 
mutations  is  referred  to  as  wild  type  (wt).  Five 
charged  residues  on  the  channel  surface,  R1 16, 
K117,  R122,  K178,  R223  (Figure  lA,  IB),  were 
all  mutated  to  alanine  through  standard  site- 
directed  mutagenesis  methods.  The  sequences  of 
all  clones  were  verified  by  sequencing  the  DNA 
region  encoding  the  entire  MCM  protein. 

Protein  purification.  All  proteins  were  grown  in 
E.  coli  cells  at  37°C  until  OD6oo^0.2  and  then  the 
temperature  was  reduced  to  24°C.  Isopropyl-P- 
D-thiogalactoside  (IPTG)  was  added  to  0.2  mM 
when  OD6oo^  0.5.  The  cells  were  shaken  at  24°C 
overnight.  Cell  pellets  were  re-suspended  in 
purification  buffer  (50  mM  Tris,  pH  8.0,  250 
mM  NaCl,  10  mM  DTT)  and  sonicated  on  ice 
with  protease  inhibitors  pepstatin,  leupeptin,  and 


phenylmethylsulfonyl  fluoride  (PMSF).  The 
GST-MCM  fusion  protein  in  the  supernatant 
was  separated  from  other  proteins  using 
glutathione  resin  (Amersham)  at  4°C.  After 
thrombin  cleavage  of  the  GST-fusion  protein, 
the  free  MCM  protein  was  further  purified  using 
Superose-6  gel  filtration  column 
chromatography  (16/60  cm).  Peak  fractions  were 
collected  and  purity  was  analyzed  by  SDS- 
PAGE.  Protein  was  concentrated  using  Amicon 
Ultra  (Millipore)  centrifuge  devices  and 
quantified  using  the  Bio-Rad  protein  assay  and 
Coomassie  blue  staining  of  an  SDS-PAGE  gel. 

Heat  stability  of  mutants.  A  protein  samples  (10 
jig)  were  heated  at  55°C  in  15  jiL  of  helicase 
buffer  for  30  minutes.  The  heat-treated  protein 
was  analyzed  by  Superose-6  column 
chromatography  and  on  a  10%  SDS-PAGE  gel. 

Helicase  assay.  A  60  nucleotide  primer  ^  TTT 

GGG  AGA  GGC  GGT  TTG  CGT  ATT  GGG 
CGC  C^  (Operon)  was  purified  and  radio- 
labeled  as  previously  described  for  DNA 
binding  assays.  The  labeled  primer  was  mixed  in 
a  2:1  ratio  with  M13mpl8  circular  single- 
stranded  DNA  in  300  mM  NaCl  and  20  mM  Tris 
(pH  8.0),  boiled  in  900  jiL  water  for  3  minutes 
and  allowed  to  cool  to  room  temperature  over  5 
hours.  This  helicase  substrate  containing  34 
complementary  bases  and  a  26  dT  non¬ 
complimentary  overhang  was  purified  from 
excess  primer  using  700  jiL  Sephacryl  300HR 
resin  in  a  2-mL  glass  column  (Bio-Rad). 
Fractions  containing  helicase  substrate  were 
pooled  and  stored  at  4°C.  Reactions  of  20  jiL 
containing  20  mM  Tris  (pH  7.8),  10  mM  MgCb, 
1  mM  DTT,  5  mM  ATP,  0.1  mg/mL  BSA, 
helicase  substrate,  and  MCM  protein  were 
assembled  on  ice.  The  reactions  were  then 
incubated  at  50°C  for  30  minutes,  5x  stop 
solution  (10  mM  EDTA,  0.5%  SDS,  0.1% 
xylene  cyanol,  0.1%  bromophenol  blue,  and 
50%  glycerol)  was  added  and  the  sample  was 
placed  on  ice.  Reactions  were  analyzed  on  a 
12%  polyacrylamide  gel  in  IX  TBE  run  on  ice 
for  50  min  at  150  volts.  The  gel  was  dried  and 


then  exposed  to  film  and  a  phosphorimaging 
plate  for  quantification. 

DNA  binding  assay.  All  primers  (Operon)  were 
resuspended  in  10  mM  NaOH  and  purified  by 
FPLC  using  a  Mono-Q  10/100  column 
(Amersham).  Purified  DNA  was  concentrated 
and  quantified  using  OD26o*  For  the  double- 
stranded  DNA  (dsDNA)  binding  assay, 
complementary  primers  were  mixed,  boiled  for 
10  min  in  900  jiL  of  water,  and  allowed  to  cool 
to  room  temperature  overnight.  The  annealed 
dsDNA  was  then  purified  by  FPLC  over  a 
Superdex-75  16/60  (Amersham)  column  in  50 
mM  NaCl.  The  DNA  used  for  the  single-strand 
binding  was  ^  AAA  GCG  CTG  ACC  TAT  CGC 
GTA  TAG  CTC  GAG  GA^’  and  the 
complementary  oligonucleotides  used  for  the 
double  strand  binding  assay  were  ^  GCG  CTG 
ACC  TAT  CGA  CCT  ATA  CGG  TTA  GCC^’ 
and  ^’GGC  TAA  CCG  TAT  AGG  TCG  ATA 
GGT  CAG  CGC^’.  The  primers  were  labeled 
using  T4  polynucleotide  kinase  (NEB)  and  y- 
^^P-ATP  (Amersham,  3000  Ci/mmol).  The 
labeled  primer  was  purified  using  Microspin  G- 
25  columns  (Amersham)  and  stored  at  4°C. 
Reactions  of  20  jiL  were  assembled  at  room 
temperature  with  50  mM  NaCl,  5  mM  MgCb, 

50  mM  Tris  (pH  7.8),  1  mM  DTT,  0.1  mg/mL 
BSA,  mtMCM  protein  (between  0.5 -1.0  jiM), 
and  15  nM  labeled  DNA.  Reactions  were 
incubated  at  25 °C  for  30  min  and  then  5  jiL 
loading  buffer  (50%  glycerol,  0.1%  xylene 
cyanol,  and  0.1%  bromophenol  blue)  was  added. 
Reactions  were  analyzed  by  gel  electrophoresis 
and  by  filter  binding  assay.  For  gel  analysis,  the 
reactions  were  loaded  onto  a  2%  agarose  gel  and 
electrophoresed  for  1  hour  at  80  volts  in  IX  TBE 
buffer.  Gels  were  dried  and  exposed  to  film  and 
phosphorimaging  plates  for  quantification.  Eor 
filter  binding  assay,  10  ul  reaction  mixtures  were 
applied  to  nitrocellulose  filter  (Millipore,  HA 
0.45  mm).  The  filter  was  washed  with  15  ml 
reaction  buffer.  The  radioactivity  retained  on  the 
filter  was  quantified  by  liquid  scintillation 
counting. 

ATPase  assay.  Reactions  (10  jiL)  containing  50 
mM  NaCl,  5  mM  MgCb,  50  mM  Tris  (pH  7.8), 


1  mM  DTT,  0.1  mg/mL  BSA,  y-^^P-ATP 
(Amersham  3000  Ci/mmol),  15  jiM  cold  ATP, 
and  various  amounts  of  MCM  proteins  were 
assembled  on  ice.  Reactions  were  incubated  at 
50°C  for  30  minutes,  plunged  into  ice  water  and 
10  mM  EDTA  was  added.  Aliquots  (5  jiL)  from 
each  reaction  were  placed  onto  a  pre-washed 
PEI  cellulose  TLC  plate  (EMD  Chemicals  INC) 
and  run  for  two  hours  in  2  M  acetic  acid  and  0.5 
M  LiCl.  Plates  were  dried  and  exposed  to  film 
and  phosphorimaging  plates  for  quantification. 

RESULTS 

Location  of  mutated  residues  on  the  mtMCM 
double-hexamer  structure.  The  crystal  structure 
of  the  N-terminal  region  of  mtMCM  (N- 
mtMCM)  shows  that  the  five  residues  (R1 16, 

K1 17,  R122,  K178,  R223)  studied  here  are 
located  close  to  the  central  channel  surface  (Fig. 
lA,  IB).  K117,  R122,  andK178  are  highly 
conserved  among  eukaryotic  MCMs  (Figure 
1C).  R223  is  not  strictly  conserved,  but  there 
appear  to  be  few  positively  charged  residues  in 
the  vicinity  of  R223  among  the  aligned  MCM 
sequences  and  R1 16  appears  to  be  unique  to 
mtMCM  (Figure  1C).  R1 16,  K1 17,  and  R122 
are  located  on  the  same  P-strand  and  K178  is  on 
the  adjacent  anti-parallel  P-strand  (Figure  1  A, 
IB).  R223  is  located  on  the  previously 
characterized  DNA  binding  P-hairpin  finger  {10) 
and  is  close  to  the  cluster  of  the  other  mutated 
amino  acids. 


Mutant  protein  assembly  and  stability.  We  first 
performed  experiments  to  test  the  structural 
integrity  of  the  mutant  mtMCM  proteins  in  order 
to  rule  out  the  possibility  of  disruption  of  protein 
folding  or  assembly  by  the  mutations.  All 
mutant  proteins  and  the  wild-type  (wt)  protein 
were  analyzed  by  FPLC  using  an  analytical 
Superose-6  gel  filtration  column  before  and  after 
the  proteins  were  heat-treated  at  55  °C.  With  or 
without  heat  treatment,  all  mutants  had  similar 
chromatographic  profiles  as  the  wt, 
characterized  by  a  peak  (P2)  with  an  apparent 
molecular  weight  of  a  double  hexamer  of 
mtMCM  (Figure  2A).  SDS-PAGE  of  fractions 
collected  across  this  peak  revealed  a 


predominant  band  consistent  with  the  size  of  the 
full-length  protein  (Figure  2B).  N-terminal 
sequencing  showed  that  other  minor  bands  were 
degradation  products  of  the  full-length  protein 
that  could  not  be  completely  eliminated  even 
after  extensive  purification.  As  all  mutants  and 
wt  showed  similar  extents  of  degradation,  the 
degradation  should  not  interfere  with  the 
comparison  and  interpretation  of  biochemical 
results.  The  chromatography  analysis  indicated 
that  none  of  the  mutations  affected  mtMCMs 
folding,  oligomerization  state,  or  heat  stability  in 
solution. 


Helicase  assay.  A  helicase  assay  showed  that  the 
mutants  had  various  degrees  of  reduced  helicase 
activity  when  compared  with  the  wt  protein 
(Figure  3A,  3B).  Over  the  tested  protein 
concentration  range,  helicase  activity  of  R223A 
was  2-4  fold  lower  than  the  activity  of  the  wt 
protein  and  the  helicase  activities  of  R1 16A, 

K1 17A,  R122A,  and  K178A  were  3-7  fold  lower 
that  that  of  the  wt  (Figure  3A,  3B).  This 
suggests  that  these  charged  residues  are 
important  for  the  helicase  activity  of  mtMCM. 

DNA  binding.  The  ability  of  wt  and  mutants  to 
bind  single-stranded  DNA  (ssDNA)  and  double- 
stranded  DNA  (dsDNA)  was  determined  by 
native  gel  shift  using  a  radio-labeled 
oligonucleotide  and  four  different  concentrations 
for  each  protein.  All  mutants  showed  reduced 
ssDNA  binding  compared  to  wt  (Figure  4A).  At 
the  protein  concentration  of  1  jiM,  the  ssDNA 
binding  activity  for  K1 17A  and  K178A  was 
approximately  4  fold  lower  than  that  for  the  wt 
and  for  the  other  three  mutants  (R1 16A,  R122A, 
R223A),  it  was  roughly  2  fold  lower  than  wt. 
dsDNA  binding  assay  results  were  quite  similar 
to  those  of  the  ssDNA  binding  analysis  (Figure 
4B).  Four  out  of  the  five  mutants  (K1 17A, 
R122A,  K178A,  R223A)  showed  lower  dsDNA 
binding  (approximately  2-4  fold)  than  the  wt 
(Figure  4B).  However,  R1 16A  bound  to  dsDNA 
at  a  level  similar  to  the  wt  protein. 

ATPase  activity.  The  effect  of  each  mutation  on 
the  ability  of  mtMCM  to  hydrolyze  ATP  was 


measured  using  standard  ATPase  assays.  The 
mutations  (R116A,  K117A,  R122A,  K178A, 
R223A)  had  little  effect  on  the  level  of  ATP 
hydrolysis  when  compared  to  wt  (Figure  5 A, 

5B),  suggesting  that  none  of  the  residues  are 
directly  involved  in  ATP  binding  or  hydrolysis. 

DISCUSSION 

The  charged  residues  under 
investigation  (RI 16,  KI 17,  RI22,  KI78,  and 
R223)  are  all  located  on  the  inner  central 
channel  surface  of  the  N-mtMCM,  a  location 
potentially  important  for  interacting  with  DNA, 
which  binds  within  the  central  channel,  and  for 
helicase  function.  We  showed  here  that  mutation 
of  any  of  these  five  residues  to  alanine  resulted 
in  lower  helicase  activity  compared  with  the  wt 
protein.  Although  all  mutants  showed  ATPase 
activity  comparable  to  that  of  wt,  all  five 
mutants  had  consistently  reduced  ssDNA 
binding  and  four  out  of  the  five  mutants  showed 
reduced  dsDNA  binding,  an  observation 
consistent  with  the  role  of  these  residues  in 
interaction  with  DNA. 

Even  though  the  positively  charged 
residues  mutated  in  this  study  are  all  located  on 
the  inner  channel  of  the  double  hexamer,  the  five 
residues  have  different  degrees  of  exposure  to 
the  channel  surface.  For  example,  R122,  Rl  16, 
and  Kl  17  are  located  on  the  side  of  the 
monomer  that  faces  the  next  monomer  within 
the  ring  (Figure  1  A,  IB),  forming  the  edge 
around  the  wide-open  side  of  the  channel.  These 
three  residues  are  not  as  exposed  to  the  channel 
surface  as  R 122  and  K178.  In  order  to  directly 
contact  DNA,  a  modest  conformational  change 
would  be  required  to  expose  these  residues  more 
to  the  inner  channel  surface.  Alternatively,  DNA 
may  interact  with  these  residues  by  binding  or 
passing  the  side  channel,  as  suggested  in  the 
looping  model  of  SV40  LTag  {14-16). 

Furthermore,  the  location  of  the  residues 
near  the  interface  between  two  adjacent  subunits 
(R122,  Rl  16,  Kl  17)  may  influence  the 
interactions  between  two  neighboring  monomers 
during  conformational  changes  necessary  for 
helicase  function.  Of  the  residues  evaluated, 
R122  has  been  shown  to  have  direct  contact  with 


the  next  neighbor  through  the  P -hairpin  (Figure 

IB) .  Indeed,  of  the  mutants  analyzed,  R122A, 

R1 16A,  and  K1 17A  showed  the  most  significant 
reduction  in  ssDNA  and  dsDNA  binding  (Figure 
4)  as  well  as  in  helicase  function  (Figure  3). 

R1 16,  K1 17,  and  R122  are  conserved  among 
most  S.  cerevisiae  and  human  MCMs  (Figure 

IC) ,  suggesting  the  importance  of  these  residues 
for  MCM  function. 

Based  on  the  N-mtMCM  structure, 

R223  sits  on  the  previously  characterized  DNA 
binding  P-hairpin  structure  that  has  been  shown 
to  be  important  for  helicase  function  (10,  17). 

Previously,  double  or  triple  mutations  of  three 
other  positively  charged  residues  on  this  P- 
hairpin,  R226,  R229,  and  K231,  were  shown  to 
completely  disrupt  the  DNA  binding  and 
helicase  function.  Intriguingly,  point  mutation  of 
R223  to  alanine  only  partially  disrupted  MCM 
DNA  binding  and  helicase  activity  (Figure  3, 

Figure  1:  Location  of  the  five  positively  charged  residues  on  the  inner  channel  surface  of  N- 
mtMCM.  A:  N-mtMCM  hexamer  structure  showing  the  five  mutated  residues,  R1 16  (yellow),  K1 17 
(pink),  K178  (green),  R122  (blue),  and  R223  (cyan).  B:  A  close-up  of  the  N-mtMCM  view  shown  in 
panel  A.  R223  is  on  one  side  of  the  P-hairpin,  R122  is  adjacent  to  the  P-hairpin  of  another  monomer. 

K1 17  is  also  on  one  side  of  a  monomer,  around  the  side  channel  formed  between  two  neighboring 
subunits.  C:  Sequence  alignments  of  mtMCM  (methanoMCM)  and  MCM2-7  from  S.  cerevisiae 
(scMCM)  and  humans  (humanMCM).  Amino  acids  mutated  in  this  study  are  marked  by  asterisks.  Red 
boxes  highlight  aligned  residues  that  are  mutated  in  this  study. 

Figure  2:  Gel  filtration  analysis  of  wt  and  mutant  mtMCM  proteins.  A:  FPLC  overlay  of  OD280 
curves  from  wt  (blue)  and  the  five  mutant  proteins  run  on  a  preparative  grade  Superose  6  column.  The 
profiles  of  all  proteins  overlapped.  Peak  1  (PI)  is  in  the  void  volume,  P2  contains  protein  that  has  an 
apparent  molecular  weight  (Mwt)  of  approximately  960  kD,  peak  3  (P3)  contains  degraded  protein  that 
cannot  oligomerize.  The  peak  P2  has  an  apparent  molecular  weight  consistent  with  that  of  a  double 
hexamer.  B:  SDS-PAGE  analysis  of  the  peak  fraction  of  P2  for  the  wt  and  each  mutant  protein.  Lanes  1-6 
are  wt,  R1 16A,  K1 17A,  K178A,  R122A,  and  R223A,  respectively.  An  asterisk  marks  the  position  of  the 
full-length  protein.  The  minor  bands  below  the  full-length  protein  band  were  the  result  from  the  small 
amount  of  degradation  of  the  full-length  protein. 

Figure  3:  Helicase  activity  of  wt  and  mutant  proteins.  A:  Gel  analysis  of  helicase  assay.  Increasing 
concentrations  of  wt  (lane  3-6),  R223A  (lane  7-10),  R1 16A  (lanes  11-14),  K1 17A  (lanes  15-18),  R122A 
(lanes  19-22),  and  K178A  (lanes  23-26)  were  incubated  with  radio-labeled  helicase  substrate  at  50°C  for 
30  minutes  and  samples  were  then  analyzed  by  12%  polyacrylamide  gel  electrophoresis.  Boiled  and  no 
protein  controls  are  in  lanes  1  and  2,  respectively.  B:  Quantitation  of  unwinding  in  panel  A,  expressed  as 
percentage  of  ssDNA  released  from  the  dsDNA  substrate.  Protein  concentrations  were  calculated  based 
on  double  hexamer  molecular  weights.  Error  bars  (s.e.m.)  are  from  a  minimum  of  three  independent 
experiments. 


Figure  4),  suggesting  that  the  multiple  positively 
charged  residues  on  the  P-hairpin  may  be 
partially  complementary  to  each  other. 

In  summary,  this  mutational  study 
clearly  demonstrated  that  the  positively  charged 
residues  on  the  inner  central  channel,  R1 16, 

K1 17,  R122,  K178  and  R223,  are  critical  for  the 
functional  activities  of  mtMCM.  Mutation  of 
any  of  these  residues  resulted  in  decreased 
helicase  activity,  which  was  strongly  associated 
with  reduced  binding  to  both  ssDNA  and 
dsDNA.  Even  though  these  residues  are  located 
outside  of  the  highly  conserved  MCM  box 
(helicase  domain),  the  critical  residues  for 
classifying  MCM  proteins,  four  of  the  five 
residues  are  highly  conserved  in  eukaryotes, 
underscoring  the  functional  significance  of  these 
charged  residues  of  the  hexameric  channel 
surface  of  the  N-terminal  region  of  MCM. 


Figure  4:  DNA  binding  of  wt  and  mutants.  A:  Quantitation  of  ssDNA  binding  of  wt  and  mutants  at  the 
indicated  protein  concentrations.  B:  Quantitation  of  dsDNA  binding  of  wt  and  mutants  at  the  indicated 
protein  concentrations.  Protein  concentrations  were  calculated  based  on  double  hexamer  molecular 
weights.  Error  bars  (s.e.m.)  are  from  a  minimum  of  three  independent  experiments. 

Figure  5:  ATPase  activity  of  the  wt  and  mutants.  A:  TLC  analysis  of  ATPase  assay.  Four  protein 
concentrations,  16,  32,  64,  and  128  nM,  were  used  for  wt  (lanes  2-5  and  19-22),  R1 16A  (lanes  6-9), 

K1 17A  (lanes  10-13),  R122A  (lane  14-17),  K178A  (lanes  23-26),  and  R223A  (lanes  27-28).  No  protein 
control  is  in  lanes  1  and  18.  B:  TLC  analysis  of  ATPase  assay  of  the  mutants  in  the  presence  of  dsDNA 
(10  ng/ul  1.0  kb  PCR  DNA  product),  showing  that  all  mutants  responded  to  dsDNA  stimulation  of 
ATPase  activity  similarly  to  the  wt.  C:  Quantitation  of  results  from  panel  A.  Error  bars  (s.e.m.)  are  from 
a  minimum  of  three  independent  experiments.  Protein  concentrations  were  calculated  based  on  double 
hexamer  molecular  weights. 
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The  GINS  complex,  which  contains  the  four  subunits  SIdS,  Psf1, 
Psf2,  and  Psf3,  is  essential  for  both  the  initiation  and  progression 
of  DNA  replication  in  eukaryotes.  GINS  associates  with  the  MCM2-7 
complex  and  Cdc45  to  activate  the  eukaryotic  minichromosome 
maintenance  helicase.  It  also  appears  to  interact  with  and  stimulate 
the  polymerase  activities  of  DNA  polymerase  e  and  the  DNA 
polymerase  a-primase  complex.  To  further  understand  the  func¬ 
tional  role  of  GINS,  we  determined  the  crystal  structure  of  the 
full-length  human  GINS  heterotetramer.  Each  of  the  four  subunits 
has  a  major  domain  composed  of  an  a-helical  bundle-like  structure. 
With  the  exception  of  Psf1,  each  of  the  other  subunits  has  a  small 
domain  containing  a  three-stranded  jS-sheet  core.  Each  full-length 
protein  in  the  crystal  has  unstructured  regions  that  are  all  located 
on  the  surface  of  GINS  and  are  probably  involved  in  its  interaction 
with  other  replication  factors.  The  four  subunits  contact  each  other 
mainly  through  a-helices  to  form  a  ring-like  tetramer  with  a  central 
pore.  This  pore  is  partially  plugged  by  a  16-residue  peptide  from 
the  Psf3  N  terminus,  which  is  unique  to  some  eukaryotic  Psf3 
proteins  and  is  not  required  for  tetramer  formation.  Removal  of 
these  N-terminal  16  residues  of  Psf3  from  the  GINS  tetramer 
increases  the  opening  of  the  pore  by  80%,  suggesting  a  mechanism 
by  which  accessibility  to  the  pore  may  be  regulated.  The  structural 
data  presented  here  indicate  that  the  GINS  tetramer  is  a  highly 
stable  complex  with  multiple  flexible  surface  regions. 

Cdc45  I  DNA  helicase  |  minichromosome  maintenance  complex  | 

DNA  polymerase 

Eukaryotic  DNA  replication  is  controlled  by  a  series  of 
ordered  and  regulated  steps  (1-3)  that  commence  with  the 
binding  of  the  six-subunit  origin  recognition  complex  (ORC)  to 
replication  origins.  During  the  G1  phase  of  the  cell  cycle,  Cdc6, 
and  Cdtl  are  recruited  to  the  origin,  and  together  with  ORC, 
support  the  loading  of  the  heterohexameric  MCM2-7  complex 
(minichromosome  maintenance,  MCM)  to  form  the  prereplica¬ 
tion  complex  (pre-RC).  Although  a  substantial  amount  of  data 
suggest  that  MCM  acts  as  the  replicative  helicase,  MCM  present 
in  the  pre-RC  (as  well  as  isolated  MCM)  is  devoid  of  helicase 
activity  (summarized  in  ref.  4).  At  the  Gi/S  transition  of  the  cell 
cycle,  it  appears  that  the  MCM  helicase  activity  is  activated  by 
a  complex  and  an  as  yet  poorly  understood  series  of  modifica¬ 
tions  that  require  the  action  of  two  protein  kinases,  DDK 
(Cdc7-Dbf4)  and  CDK  (cyclin-dependent),  as  well  as  the  par¬ 
ticipation  of  at  least  eight  additional  factors,  including  McmlO, 
Cdc45,  Dpbll,  GINS,  synthetic  lethal  with  dpbll  mutant-2 
(Sld2),  and  Sld3  (4).  Two  of  these  components,  Cdc45  and  GINS, 
appear  critical  for  helicase  activation  because  DNA  unwinding 
is  observed  (3,  5),  concomitant  with  their  loading  at  origins.  In 
accord  with  these  findings,  a  complex  containing  near- 
stoichiometric  levels  of  MCM,  Cdc45,  and  GINS  was  isolated 
imm  Drosophila  and  shown  to  possess  DNA  helicase  activity  (6). 
Studies  with  Xenopus  extracts  revealed  that  a  complex  that 
included  MCM,  Cdc45,  and  GINS  was  found  at  sites  at  which 
replication  forks  were  halted  artificially  by  a  streptavidin-biotin 
complex  (7).  In  Saccharomyces  cerevisiae,  GINS  was  shown  to 


play  a  critical  role  in  supporting  interactions  between  MCM  and 
Cdc45,  as  well  as  a  number  of  key  regulatory  proteins.  They 
together  formed  a  large  replisome  progression  complex  that 
migrated  with  the  replication  fork.  Upon  selective  degradation 
of  the  Psf2  subunit  of  GINS,  replication  was  halted  and  Cdc45 
was  no  longer  associated  with  MCM.  These  findings  suggest  that 
interactions  between  MCM  and  other  key  replication  factors 
might  be  mediated  by  GINS.  Collectively,  they  indicate  that 
GINS  is  an  essential  component  of  the  replicative  machinery  that 
moves  with  the  replication  fork. 

GINS  is  a  heterotetrameric  complex  consisting  of  Sld5,  Psfl 
(partner  of  Sld5-1),  Psf2,  and  Psf3  and  was  first  discovered  by 
using  a  variety  of  genetic  screens  in  S.  cerevisiae  (8).  The  four 
GINS  subunits  are  paralogs,  among  which  the  specific  subunit 
pairs  Psfl-Sld5  and  Psf2-Psf3  are  more  closely  related  (9).  Each 
of  the  subunits  is  relatively  small  («=^200  aa)  and  highly  conserved 
in  all  eukaryotes.  In  archaea,  only  two  homologues,  GinslS  and 
Gins23,  have  been  identified  that  appear  to  interact  and  form  a 
dimer  of  the  heterodimer,  suggesting  that,  like  its  eukaryotic 
counterpart,  it  is  a  tetramer  (10).  Direct  interactions  between  the 
archaeal  GINS  complex  and  the  archaeal  MCM,  as  well  as 
primase,  have  been  reported  (10).  Other  reports  have  recently 
appeared  suggesting  that  GINS  may  serve  as  an  accessory  factor 
for  eukaryotic  DNA  polymerases,  including  DNA  polymerase 
(Pol)  £  (11)  and  the  DNA  Pol  a-primase  complex  (12). 

Despite  the  essential  role  of  GINS  in  DNA  replication,  how 
GINS  interacts  with  MCM,  Cdc45,  and  other  protein  factors 
at  the  replication  fork  remains  unclear.  To  understand  the 
structural/functional  roles  of  GINS  in  replication,  we  crystallized 
the  human  GINS  complex  and  determined  its  crystal  structure. 
This  complex  included  the  full-length  proteins  of  each  of  the  four 
subunits.  During  the  preparation  of  this  work,  the  structure  of 
the  human  GINS  complex  containing  a  truncated  Psfl  subunit 
appeared  (13).  The  tetramer  structure  that  we  have  obtained  is 
basically  the  same  as  that  reported  by  Kamada  et  al  (13);  the 
folds  of  each  subunit  and  the  interactions  between  the  four 
subunits  are  essentially  the  same  with  slight  variations  found  for 
certain  loops  and  ]8-strands.  However,  our  crystal  structure 
revealed  certain  features  not  reported  for  the  structure  of  the 
truncated  complex  reported  by  Kamada  et  al  (13)  that  may  have 
important  functional  implications.  The  structural  and  muta- 
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Fig.1.  The  overall  structural  features  of  human  GINS  complex,  (a)  Superclex-200  gel-filtration  profile  of  the  human  GINS  complex  crystallized,  {b)  The  SDS/PAGE 
analysis  of  the  peak  fraction  from  a,  showing  that  all  four  subunits  are  present  as  full-length  proteins,  (c  and  of)  Two  views  of  the  wider  faces  of  the  GINS  complex 
structure,  showing  the  spindle-shaped  structure  and  part  of  the  central  pore  opening.  The  tetramer  is  composed  of  predominantly  a-helices  that  are  arranged 
in  a-helical  bundles,  (e  and  f)  Two  views  of  the  narrower  sides  of  the  GINS  complex.  The  four  subunits  are  shown  in  different  colors  as  indicated. 


tional  data  we  obtained  suggest  that  the  dimension  of  a  central 
pore  in  GINS  appears  to  be  regulated  by  a  short  N-terminal 
peptide  of  Psf3.  The  positions  of  disordered  regions  in  our 
structure,  including  the  C-terminal  51  residues  of  Psfl,  colocalize 
on  the  surface  of  the  GINS  complex  as  patches  and  likely  serve 
as  interaction  sites  for  the  binding  of  GINS  to  its  replication 
protein  partners. 

Results 

Overall  Structural  Features  of  the  GINS  Complex.  The  four  full- 
length  subunits  of  GINS  were  coexpressed  in  Escherichia  coli  and 
the  complex  purified  to  homogeneity.  The  isolated  complex  had 
an  apparent  molecular  mass  of  ^90  kDa  as  estimated  from  gel 
filtration  chromatography  (Fig.  \a)  and  glycerol  gradient  sedi¬ 
mentation  (data  not  presented)  consistent  with  a  molar 

ratio  of  the  four  different  proteins  in  the  tetrameric  complex 
(Fig.  \b).  We  crystallized  the  GINS  complex  as  described  in 
Materials  and  Methods,  and  SDS/PAGE  and  mass  spectrometric 
analyses  confirmed  that  all  four  proteins  present  in  the  crystals 
were  full-length.  We  determined  the  x-ray  structure  of  the  GINS 
complex  to  2.36  A  resolution  [statistics  presented  in  supporting 
information  (SI)  Table  1].  Each  asymmetric  unit  in  a  crystal  cell 
contained  two  GINS  heterotetramers  with  identical  conforma¬ 
tions.  The  overall  morphology  of  the  GINS  tetramer  complex 
resembles  a  slightly  elongated  spindle  (Fig.  1  c-f)  with  a  visible 
central  hole  (Fig.  1  c  and  d).  The  body  of  the  tetramer  is 
composed  of  a-helices  with  few  peripheral  short  j8-strands.  The 
gross  structural  features  are  essentially  the  same  as  those  re¬ 
cently  reported  for  the  structure  containing  a  truncated  Psfl 
subunit  (13);  in  the  reported  structure,  47  residues  of  the 
C-terminal  region  of  Psfl  had  to  be  deleted  for  crystallization. 
Surprisingly,  even  though  we  crystallized  GINS  with  all  full- 
length  proteins,  only  the  first  145  residues  of  Psfl  were  ordered. 
These  residues  were  present  in  the  structure  containing  trun¬ 
cated  Psfl  reported  by  Kamada  et  al.  (13).  The  C-terminal  51 


residues  of  Psfl  are  not  visible  in  our  structure,  indicating  that 
this  region  is  intrinsically  disordered. 

Structures  of  Individual  GINS  Subunits.  Each  of  the  four  subunits 
contains  a  major  domain  composed  of  a-helices  (a-domains.  Fig. 
2  a-d).  The  folds  of  the  a-domains  of  all  four  subunits  are 
similar;  each  contains  four  to  five  helices  arranged  more  or  less 
in  a  parallel  fashion  to  form  a  partial  three-helix  bundle  struc¬ 
ture.  In  three  of  the  four  subunits  (Sld5,  Psf2,  and  Psf3),  there 
is  a  small  j8-sheet  composed  of  three  antiparallel  ]8-strands  (]81, 
]82,  and  j83)  near  one  end  of  the  a-domain.  Around  the  j3-sheet 
are  two  helices  and  a  ]8-hairpin  or  loop,  forming  a  small  but 
definable  ]8-domain  in  these  three  subunits.  The  monomeric 
structures  overlap  well  with  the  reported  GINS  structure  (13), 
especially  within  the  a-domains.  However,  conformational  dif¬ 
ferences  are  noted  in  the  j8-domains  of  each  subunit  (Fig.  2  e-g). 
The  differences  in  the  j3-domains  and  in  a  few  loops  are  all 
located  on  the  surface  of  the  GINS  complex,  suggesting  a  certain 
degree  of  plasticity  in  the  surface  structures. 

Psfl  has  only  an  a-domain  (residues  1-145),  whereas  all  of  its 
C-terminal  51  residues  are  disordered.  There  is  substantial  space 
to  accommodate  the  C-terminal  51  residues  of  Psfl  in  at  least 
one  of  the  two  GINS  tetrameric  complexes  in  the  asymmetric 
unit.  Nonetheless,  no  definable  electron  density  can  be  seen  for 
the  C-terminal  51  residues  of  Psfl  (residues  146-196),  despite 
the  fact  that  sequence  alignment  suggests  a  fold  similar  to  the 
]8-domain  that  is  present  in  the  other  three  subunits  (SI  Fig.  la). 
Kamada  et  al.  (13)  reported  that  their  GINS  complex  crystallized 
only  when  a  Psfl  mutant  lacking  the  C-terminal  47  residues  (14) 
was  used,  suggesting  that  the  presence  of  this  ]8-domain  inhibited 
crystal  packing  of  the  GINS  complex.  Kamada  et  al.  (13) 
proposed  that  the  deleted  region  of  Psfl  folds  into  a  j3-domain 
structure  and  that  the  correct  positioning  of  this  domain  on  the 
surface  of  the  GINS  complex  is  critical  for  function.  However,  we 
crystallized  the  GINS  complex  with  all  full-length  proteins  under 
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Fig.  2.  The  structural  folds  of  individual  subunits,  (a-of)  The  detailed  structures  of  each  of  the  four  subunits.  N  and  C  indicate  the  position  of  the  N  and  C  termini 
in  each  subunit,  whereas  dashed  lines  indicate  the  location  of  disordered  fragments.  The  a-domains  (a-D)  and  /3-domains  (/3-D)  for  each  subunit  are  indicated. 
Labels  for  the  secondary  structures  in  the  a-domains  start  with  a  and  labels  for  the  jS-domains  start  with  jS.  (e-g)  A  comparison  of  the  jS-domains  of  our  structure 
(in  color)  and  that  reported  for  the  structure  determined  by  Kamada  etal.  (13),  showing  conformational  differences  in  Sld5  (e),  Psf2  (f),  and  Psf3  (g). 


different  crystallization  conditions  but  in  the  same  space  group 
with  similar  unit  cell  dimensions.  Our  structure  revealed  that  the 
C-terminal  51  residues  of  Psfl  are  not  folded  in  the  heterotet- 
rameric  GINS  complex  and,  thus,  are  not  anchored  to  the  GINS 
surface.  These  findings  suggest  that  the  Psfl  residues  proposed 
to  anchor  the  ]8-domain  on  the  GINS  surface  by  Kamada  et  al 
(13)  could,  instead,  play  a  role  in  binding  other  protein  partners. 

Even  though  the  GINS  subunits  have  similar  a-  and  j8-domains, 
the  relative  arrangements  of  the  two  domains  differ  among  the 
subunits,  as  predicted  in  ref.  9.  In  Sld5  and  Psfl,  the  larger 
N-terminal  portion  forms  the  a-domain,  and  the  smaller  C-terminal 
fragment  corresponds  to  the  j8-domain  (SI  Fig.  la).\n  contrast,  the 
order  of  a-  and  ]8-domains  of  Psf2  and  Psf3  is  reversed  (i.e.,  the 
j8-domain  is  at  the  N  terminus  and  the  a-domain  is  at  the  C 
terminus;  SI  Fig.  Ih).  In  addition,  the  space  between  the  a-  and 
j3-domains  for  Psf2/Psf3  is  only  6  residues,  but  is  21  residues  in  Sld5 
and  possibly  about  the  same  length  in  Psfl,  based  on  sequence 
alignment  (SI  Fig.  7  a  and  h).  Despite  the  differences  in  spacer 
length,  the  j3-domains  present  in  Sld5,  Psf2,  and  Psf3  appear 
anchored  to  their  respective  a-domains  through  direct  contacts 
(Fig.  2  a,  c,  and  d). 

Tetramer  Formation.  The  structure  of  the  GINS  tetramer  was 
reported  to  be  formed  by  an  arbitrarily  assigned  “vertical”  interface 
formed  through  the  a-domains  and  a  “horizontal”  interface  me¬ 
diated  through  the  j8-domains  (13).  Despite  the  similarities  of  our 
GINS  tetramer  structure  to  that  reported,  we  interpret  the  inter¬ 
actions  that  support  tetramerization  in  our  crystal  structure  some¬ 
what  differently,  as  described  below.  In  our  structure,  each  of  the 
four  subunits  interacts  with  two  other  molecules  to  form  a  ring-like 
structure  with  a  central  hole  (Fig.  3a).  The  four  subunits  are 
arranged  around  the  ring  in  the  order  Sld5,  Psfl,  Psf3,  and  Psf2, 
such  that  Psf2  contacts  Sld5  to  complete  the  circle.  Within  the  pore 
created  by  this  ring,  there  are  no  direct  bonding  contacts  between 


Sld5  and  Psf3  or  between  Psfl  and  Psf2.  The  intersubunit  interac¬ 
tions  are  mainly  through  the  sides  of  the  a-helical  bundles  to 
generate  extensive  contacts  between  subunits  with  buried  interface 
areas  between  two  neighboring  subunits  ranging  from  2,900  to 
4,000  A^.  These  large  interface  areas  presumably  provide  strong 
bonding  forces  at  the  subunit  interfaces,  which  explains  why  the 
tetrameric  complex  is  stable  in  solution.  Major  bonding  forces  are 
provided  through  helix- helix  interactions  between  adjacent  a- 
domains  of  different  subunits  (Sld5-Psfl,  Psfl-Psf3,  Psf3-Psf2,  and 
Psf2-Sld5),  mediated  mostly  by  hydrophobic  residues.  However, 
the  j8-domains  at  the  interface  between  Psf2-Sld5  also  play  a  role 
in  stabilizing  the  tetramer.  The  j82  of  the  Sld5  ]8-domain  interacts 
with  a  small  j3-strand  (q:s4)  from  the  Psf2  a-domain,  expanding  the 
Sld5  j8-domain  to  a  four-stranded  ]8-sheet  (Fig.  3h).  Additionally,  a 
]8-hairpin  from  the  Psf2  j8-domain  contacts  an  a-helix  of  the  Sld5 
a-domain  (Fig.  3c). 

Possible  Roles  of  the  Unstructured  Regions  of  Sld5,  Psfl,  and  Psf3. 

Although  our  crystal  structure  of  the  GINS  complex  included 
full-length  proteins  of  all  four  subunits,  portions  of  Sld5,  Psfl, 
and  Psf3  have  no  visible  electron  density  (represented  by  dashed 
lines  in  Fig.  2  a-d  and  SI  Fig.  7  a  and  h),  likely  due  to  the 
flexibility  of  these  regions.  Comparisons  between  our  structure 
and  that  reported  by  Kamada  et  al  (13)  reveal  that  the  disor¬ 
dered  regions  in  Sld5  and  Psf3  are  basically  the  same  in  both 
structures.  Because  our  GINS  constructs  and  crystallization 
conditions  differed  from  those  of  Kamada  et  al  (13),  the  fact  that 
similar  regions  are  missing  from  the  electron  density  maps  of 
both  structures  suggests  that  these  regions  are  highly  flexible  and 
unstructured  in  the  GINS  complex.  The  last  visible  C-terminal 
residue  of  Psfl  (SMS)  before  the  disordered  C-terminal  domain 
is  adjacent  to  the  disordered  fragment  of  Sld5  (residue  65-71) 
and  to  the  disordered  C-terminal  residues  of  Psf3  (residues 
194-216);  these  disordered  regions  are  shown  as  spheres  in  Fig. 
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Fig.  3.  The  ring  structure  of  the  GINS  complex  and  colocalization  of  disor¬ 
dered  regions  on  the  surface,  (a)  The  ring  structure  of  the  GINS  complex  can 
be  visualized  along  the  long  axis  (the  axis  of  the  a-helical  bundles)  of  the 
tetramer.  From  this  view,  it  is  clear  that  most  of  the  intersubunit  interactions 
are  mediated  through  a-helices.  The  central  opening  is  also  apparent,  {b  and 
c)  These  views  show  interactions  between  SIdS  and  Psf2,  which  involve 
/3-strands  at  two  locations  on  intersubunit  interfaces,  (of)  Colocalization  of  the 
disordered  regions  on  the  tetramer  surface.  Each  sphere  indicates  the  location 
of  the  residue  immediately  adjacent  to  the  disordered  fragment.  The  subunit 
(and  sphere)  color  scheme  is  the  same  as  that  used  in  a.  The  disordered 
fragments  from  three  subunits  colocalize  on  one  end  of  the  tetramer.  As 
shown,  the  disordered  region  of  Psf3  is  on  the  same  face  but  at  the  opposite 
end.  These  disordered  regions  may  become  structured  when  bound  to  inter¬ 
acting  partners. 

3d.  The  colocalization  of  these  disordered  parts  of  three  differ¬ 
ent  subunits  on  the  GINS  surface  suggests  that  this  site  may  bind 
partner  proteins  in  the  replication  complex.  On  the  same  side, 
but  located  at  the  other  end  of  the  tetramer,  is  the  disordered 
region  within  the  j8-domain  of  Psf3  (Fig.  3d),  which  may  also 
serve  as  a  protein-binding  site.  These  disordered  regions  located 
on  the  surface  of  GINS  may  become  structured  upon  binding  to 
proteins  known  to  interact  with  GINS,  such  as  MCM,  Cdc45,  and 
DNA  polymerases.  Most  of  these  are  large  proteins  or  complexes 
and  the  location  of  disordered  sites  on  opposite  sides  of  the 
tetramer  may  allow  binding  of  more  than  one  of  these  factors  at 
the  same  time.  In  keeping  with  this  idea,  previous  studies  on 
Xenopus  reported  that  the  chromatin  loading  of  Cdc45  and  GINS 
were  mutually  dependent  (15).  Furthermore,  Kamada  et  al  (13) 
observed  that  the  human  GINS  complex  containing  the  Psfl 
subunit  with  the  flexible  C-terminal  region  truncated  (labeled 
Psfl  C  in  Fig.  3d)  failed  to  bind  to  Wit  Xenopus  pre-RC  and  failed 
to  load  Cdc45.  These  findings  support  our  notion  that  the 
flexible  regions  present  on  the  GINS  surface  are  important  for 
its  binding  to  replicative  proteins. 

Accessibility  of  the  Central  Pore.  When  visualized  by  negatively 
stained  EM,  the  recombinant  Xenopus  GINS  complex  has  a 
ring-like  structure  with  a  central  hole  (15).  Kamada  et  al  (13) 
suggested  that  the  ring-like  shape  observed  by  EM  was  an  artifact 
resulting  from  low-density  distribution  of  electrons  at  the  center 
of  the  complex.  Examination  of  our  GINS  structure,  however. 
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Fig.  4.  The  structure  of  the  central  pore  of  GINS  and  its  accessibility,  (a)  A 
view  of  the  central  pore  of  the  GINS  tetramer.  The  red  loop  inside  the  pore  is 
formed  by  the  N-terminal  16  residues  of  Psf3.  The  loop  is  not  tightly  bonded 
to  the  pore  surface,  (b)  The  surface  representation  of  the  view  in  a.  (c)  The 
same  view  as  in  a  but  without  the  N-terminal  residues  of  Psf3.  (c/)  The  surface 
representation  of  the  view  shown  in  c. 

revealed  that  it  does  have  ring-like  structure  when  viewed  along 
the  central  pore,  as  shown  in  Fig.  Aa.  Based  on  EM  measure¬ 
ments,  the  ring  diameter  was  reported  to  be  ^9.5  nm  (95  A),  with 
the  central  pore  ^A  nm  (40  A)  in  diameter  (15);  in  contrast,  the 
diameter  of  the  ring  in  our  crystal  structure  is,  at  maximum,  «=^78 
A  (measured  from  ed^e  to  edge  in  the  large  dimension)  and  the 
central  pore  is  10  A  in  diameter  (Fig.  4  a  and  b).  These 
quantitative  differences  in  dimensions  could  be  due  to  flattening 
(deformation)  and  dehydration  effects  intrinsic  to  negative 
staining  methods  used  to  prepare  EM  samples. 

Detailed  examination  of  the  central  pore  present  in  our  GINS 
crystal  structure  revealed  that  a  16-residue  peptide  loop  from  the 
N  terminus  of  Psf3  appeared  to  fit  loosely  into  the  pore, 
effectively  restricting  the  opening  (Fig.  4  a  and  b,  red).  Inter¬ 
actions  of  this  peptide  loop  with  the  surface  of  the  pore  are 
limited,  suggesting  that  the  peptide  may  enter  and  leave  the  pore 
without  a  significant  energy  barrier.  Interestingly,  sequence 
alignment  of  this  16-aa  sequence  at  the  N  terminus  of  Psf3 
revealed  that  it  is  present  in  human  and  some  higher  eukaryotes 
but  not  in  many  other  organisms  (SI  Fig.  Sa).  This  sequence  data 
suggest  that  the  first  16  N-terminal  residues  of  Psf3  may  not  be 
needed  for  the  structural  integrity  of  the  GINS  tetramer.  A 
speculative  role  of  this  16-residue  peptide  maybe  to  regulate  the 
dimensions  of  the  central  pore,  and  thus  its  accessibility,  by 
plugging  and  unplugging  this  cavity. 

To  test  the  notion  that  this  N-terminal  16-residue  peptide  is 
not  required  for  tetramer  formation  and  stability  of  the  GINS 
complex,  we  generated  Psf3  constructs  lacking  either  10  or  18 
residues  of  the  N  terminus  and  examined  whether  such  con¬ 
structs  supported  tetramer  formation.  Both  Psf3  constructs 
formed  stable  tetramers  under  high  and  low  salt  conditions  and 
behaved  similarly  to  the  GINS  complex  containing  full-length 
Psf3  in  gel  filtration  analyses  (SI  Fig.  86).  This  finding  indicates 
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Fig.  5.  The  mapping  of  the  yeast  ts  mutants  on  the  equivalent  sites  present 
in  human  GINS  structure,  (a  and  b)  The  class  I  ts  mutant  residues  involved  in  the 
intersubunit  interactions  between  Psf  1  R74  and  Psf3  (a)  and  between  Psf3  L72 
and  Psfl  (b).  (c-e)  The  class  II  ts  mutant  residues  Psf3  A73,  SIdS  L222,  and  Psf2 
R124  are  involved  in  intrasubunit  interactions.  Mutation  of  these  residues,  as 
indicated,  would  be  expected  to  affect  the  folding  and  stability  of  individual 
subunits  within  the  GINS  complex. 


that  the  N-terminal  16-residue  loop  of  Psf3  is  not  needed  for 
folding  and  stability  of  the  GINS  complex. 

The  diameter  of  the  central  pore  is  A  at  its  narrowest 
point,  but  the  opening  is  increased  to  18  A  upon  removal  of  the 
N-terminal  16-residue  loop  of  Psf3  (Fig.  4  c  and  d).  This  finding 
may  also  partially  explain  the  larger  pore  size  observed  in  EM 
(15)  because  the  acidic  negative  staining  solution  used  for  EM 
may  dislodge  the  N-terminal  peptide  of  Psf3  from  the  pore. 
Thus,  the  16-residue  peptide  at  the  N  terminus  of  human  Psf 3 
might  regulate  pore  opening  and  closing.  Although  a  functional 
role  for  the  central  pore  is  as  yet  unknown,  we  speculate  that  one 
possible  role  is  to  bind  and  hold  a  fragment  from  MCM,  Cdc45, 
or  DNA  polymerases  during  DNA  synthesis.  It  is  also  possible 
that  the  pore  could  bind  ssDNA  because  its  dimension  is 
sufficiently  large  to  accommodate  ssDNA  even  with  the  N- 
terminal  peptide  of  Psf 3  situated  within  the  pore.  In  support  of 
this  possibility,  there  are  10  Arg  and  Lys  residues,  as  well  as 
several  Asn  and  Gin  amino  acids  distributed  on  the  surface  of  the 
pore  (SI  Fig.  9),  despite  the  overall  negatively  charged  outer 
surface  of  the  tetramer.  This  charge  distribution  is  similar  to  that 
of  the  proliferating  cell  nuclear  antigen  (PCNA)  and  T7  gp4 
helicase,  both  of  which  have  positively  charged  and  polar  resi¬ 
dues  around  the  central  openings  even  though  they  have  an 
overall  negatively  charged  outer  surface  (14,  16).  The  GINS 
complex  might  require  an  additional  factor(s)  to  bind  ssDNA 
because  DNA-binding  activity  has  not  as  yet  been  detected  for 
the  GINS  complex  isolated  from  human  (data  not  shown)  or 
other  organisms  (6,  11). 

Understanding  Temperature-Sensitive  (ts)  Mutants  of  GINS.  A  total  of 
nine  different  ts  mutations  have  been  identified  in  the  four  yeast 
GINS  proteins  (8,  17,  18).  These  mutated  residues  are  high¬ 
lighted  in  the  sequence  alignments  shown  in  SI  Fig.  7  a  and  b, 
eight  of  which  are  highly  conserved  across  eukaryotes.  Our 
GINS  structure  provides  a  rationale  for  all  of  these  ts  mutations. 
The  nine  ts  mutations  can  be  divided  into  three  classes  according 
to  their  possible  defect(s)  at  nonpermissive  temperatures.  The 


Fig.  6.  A  model  for  the  GINS  complex  coordinating  MCM,  Cdc45,  Pol  s,  and 
Pol  a-primase  complex  at  the  replication  fork.  The  GINS  complex  is  shown  in 
surface  representation.  All  other  components  (drawn  as  cartoons)  are  shown 
as  bound  directly  to  the  GINS  complex. 


mutants  in  class  I  involve  residues  important  for  intersubunit 
interactions.  Psfl-R74G  (Fig.  5a)  and  Psf3-L72P  (Fig.  5b)  belong 
to  this  class.  Psfl  R74  (yeast  Psfl-1  R84G)  forms  hydrogen 
bonds  with  Psf3  E17  and  S28  (Fig.  5a);  a  Gly  substitution  at  this 
position  should  abolish  the  two  hydrogen  bonds  and  significantly 
weaken  interactions  between  Psfl  and  Psf3,  which  may  lead  to 
instability  of  the  complex  at  nonpermissive  temperatures.  Sim¬ 
ilarly,  the  Psf3  L72P  mutation  (yeast  Psf3-21  L46P)  should 
reduce  the  hydrophobic  packing  with  Psfl  F64  (Fig.  5b).  The  ts 
mutants  in  class  II  involve  residues  important  for  intrasubunit 
interactions.  Psf3  A73  (yeast  Psf3-21  A47P)  and  Sld5  L222  (yeast 
Sld5-13  L293P)  are  both  involved  in  j8-domain  packing,  as  well 
as  a-  and  j8-domain  interactions  (Fig.  5  c  and  d).  Psf2  R124  (yeast 
Psf2-209  R133K)  forms  five  hydrogen  bonds  within  three  helices 
and  two  loops  in  its  a-domain  (Fig.  5e).  Class  III  ts  mutants 
involve  residues  buried  in  the  hydrophobic  core  structures  of 
Sld5  and  Psf 3.  Mutations  of  these  buried  residues  would  be 
expected  to  reduce  the  thermodynamic  stability  of  the  structure, 
providing  a  plausible  molecular  explanation  for  the  phenotypes 
of  these  mutants.  Surprisingly,  ts  mutations  have  not  as  yet  been 
identified  on  the  surface  of  the  GINS.  Because  GINS  interacts 
directly  with  the  MCM  complex,  Cdc45,  and  other  replicative 
protein  in  different  species  (4),  it  is  likely  that  the  binding  sites 
for  these  proteins  are  conserved.  We  speculate  that  mutations  on 
the  surface  of  GINS  that  result  in  temperature  sensitivity  may 
help  identify  their  binding  sites. 

Structural  Conservation  of  GINS  in  Archaea.  The  archaeon  Sulfolo- 
bus  solfataricus  has  two  GINS  homologs,  GinslS  and  Gins23 
(10).  Sequence  analysis  indicates  that  eukaryotic  Psfl/Sld5  and 
Psf2/Psf3  are  close  paralogs  of  GinslS  and  Gins23,  respectively 
(9),  suggesting  that  these  two  GINS  proteins  can  form  a  tet- 
rameric  complex  through  the  dimerization  of  the  heterodimer. 
The  two  independently  derived  crystal  structures  of  human 
GINS  reveal  that  the  residues  involved  in  the  interfaces  between 
subunits  are  conserved  among  archaeal  and  eukaryotic  GINS 
proteins  (data  not  shown),  providing  further  support  for  a 
structural  and  functional  conservation  between  archaeal  and 
eukaryotic  GINS  complexes.  In  keeping  with  these  consider¬ 
ations,  it  was  shown  that  the  GINS  complex  formed  in  the 
archaeon  is  a  tetramer  formed  through  the  dimerization  of  the 
Ginsl5-Gins23  heterodimer  (10). 

A  Model  for  GINS  Role  at  the  Replication  Fork.  A  speculative  model 
of  how  the  human  GINS  tetrameric  complex  interacts  and 
coordinates  the  activities  of  its  binding  partners  is  proposed  in 
Fig.  6.  In  this  model,  we  suggest  a  direct  contact  between  Psfl 
and  Pol  8  because  an  interaction  between  S.  cerevisiae  Psfl  and 
Dpb2  (the  second  largest  subunit  of  Pol  s)  was  detected  in  a  yeast 
two-hybrid  screen  (8).  Because  archaeal  Gins23  binds  to  the  N 
terminus  of  MCM  (10)  and  an  interaction  of  Psf 3  with  MCM  has 
been  detected  in  the  yeast  two-hybrid  system  (S.  Azuma  and  H. 
Masukata,  unpublished  data  in  ref.  18),  our  model  depicts  Psf 3 
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in  contact  with  MCM.  We  assume  that  Psf2  contacts  the  Pol  a- 
primase  complex  based  on  the  report  that  archaeal  Gins23 
interacts  with  primase  (10).  The  model  also  allows  MCM,  Cdc45, 
and  GINS  to  contact  each  other  based  on  the  isolation  of  this 
complex  from  Drosophila  eggs  (6).  Finally,  we  also  suggest  that 
GINS  must  interact  with  Pol  s  and  the  Pol  a-primase  complex 
to  coordinate  leading-  and  lagging-strand  synthesis,  respectively 
(11,  12). 

In  summary,  the  structural  and  biochemical  data  presented 
both  here  and  by  Kamada  et  al  (13)  suggest  that  GINS  functions 
as  a  tight  heterotetrameric  complex.  The  molecular  interactions 
between  the  subunits  of  GINS  are  mediated  mostly  through 
helix- helix  interactions  that  amplify  the  helix-bundle-like  struc¬ 
ture  of  each  individual  subunit.  In  agreement  with  an  EM  study 
of  Xh^Xenopus  GINS  complex  (15),  we  find  an  open  pore  along 
the  long  axis  of  the  ring-like  tetramer  structure.  The  pore  size 
appears  to  be  influenced  by  the  position  of  the  N-terminal  16-20 
aa  residues  of  Psf3,  providing  a  possible  mechanism  for  regu¬ 
lating  the  accessibility  of  the  central  opening  for  its  binding  to  a 
peptide  from  an  interacting  factor  or  to  ssDNA.  The  conserva¬ 
tion  of  the  GINS  sequence  among  archaea  and  eukaryotes 
suggests  a  fundamental  functional  role  for  this  complex  in  DNA 
replication;  it  is  likely  that  GINS  serves  as  a  scaffold  for  the 
assembly  and  maintenance  of  an  active  helicase  and  replication 
complex  at  the  fork.  This  structure  should  provide  a  framework 
for  future  studies  directed  at  how  the  GINS  complex  interacts 
with  other  replication  proteins  that  jointly  support  both  the 
initiation  of  DNA  synthesis  as  well  as  fork  progression. 

Materials  and  Methods 

Cloning  and  Protein  Expression,  and  Purification  GINS.  hSldS  (Gen- 
Bank  accession  no.  NM_032336),  hPsf2  (GenBank  accession  no. 
BC062444),  and  hPsf3  (GenBank  accession  no.  BC005879) 
cDNAs  were  amplified  from  a  HeLa  cDNA  library.  hPsfl 
(GenBank  accession  no.  BC012542)  cDNA  was  derived  from 
Integrated  Molecular  Analysis  of  Genomes  and  their  Expression 
(IMAGE)  clone  4333095.  All  of  these  cDNAs  were  subcloned 
into  pGEX6P-l  (GE  Healthcare,  Chalfont  St.  Giles,  U.K.)  in  the 
order  GST-Psf2,  Sld5,  Psfl,  and  His-8-Psf3  to  produce  a  poly- 
cistronic  mRNA.  The  N-terminal  deletion  mutants  of  Psf3 
(N-terminal  10  and  18  residues  deleted,  respectively)  were 
constructed  as  a  C-terminal  His-8  fusion  (Psf3-His-8).  The  four 
subunits  of  human  GINS  were  coexpressed  in  E.  coli  cells  by 
isopropyl  ]8-D-thiogalactoside  (Sigma-Aldrich,  St.  Louis,  MO) 
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mM  Tris-HCl  (pH  8.0),  250  mM  NaCl  (buffer  A),  and  5  mM 
2-mercaptoethanol  (Sigma-Aldrich).  The  GST  and  His-8  tags 
were  subsequently  removed  by  PreScission  protease  treatment  in 
buffer  A  containing  1  mM  DTT  (Sigma-Aldrich).  The  GINS 
complex  with  molar  stoichiometry  was  obtained  by  using 

Resource  Q  ion-exchange  chromatography  with  a  50-500  mM 
NaCl  gradient  elution  and  gel-filtration  chromatography 
through  a  Superdex-200  column  (GE  Healthcare)  in  buffer 
containing  50  mM  Tris-HCl  (pH  8.0)  and  50  mM  NaCl.  The 
typical  yield  from  24-liter  culture  was  «=^30  mg. 

Crystallization  and  Structure  Determination.  Crystals  were  grown  by 
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The  minichromosome  maintenance  (MCM)  complex  plays  essential,  conserved  roles  throughout  DNA 
synthesis:  first,  as  a  component  of  the  prereplication  complex  at  origins  and,  then,  as  a  helicase  associated  with 
replication  forks.  Here  we  use  fission  yeast  (Schizosaccharomyces  pombe)  as  a  model  to  demonstrate  a  role  for 
the  MCM  complex  in  protecting  replication  fork  structure  and  promoting  recovery  from  replication  arrest. 

Loss  of  MCM  function  generates  lethal  double-strand  breaks  at  sites  of  DNA  synthesis  during  replication 
elongation,  suggesting  replication  fork  collapse.  MCM  function  also  maintains  the  stability  of  forks  stalled  by 
hydroxyurea  that  activate  the  replication  checkpoint.  In  cells  where  the  checkpoint  is  activated,  Mcm4  binds 
the  Cdsl  kinase  and  undergoes  Cdsl-dependent  phosphorylation.  MCM  proteins  also  interact  with  proteins 
involved  in  homologous  recombination,  which  promotes  recovery  from  arrest  by  ensuring  normal  mitosis.  We 
suggest  that  the  MCM  complex  links  replication  fork  stabilization  with  checkpoint  arrest  and  recovery  through 
direct  interactions  with  checkpoint  and  recombination  proteins  and  that  this  role  in  S-phase  genome  stability 
is  conserved  from  yeast  to  human  cells. 


Replicative  stress  is  a  major  source  of  mutations  that  con¬ 
tribute  to  chromosome  instability  and  the  initiation  of  cancer 
(5,  26).  A  primary  defense  against  genome  instability  is  S- 
phase  checkpoints,  which  recognize  DNA  damage,  arrest  the 
cell  cycle,  and  preserve  essential  replication  structures  (10,  38, 
62).  In  response  to  S-phase  agents  such  as  the  ribonucleotide 
reductase  inhibitor  hydroxyurea  (HU),  replication  forks  stall 
and  activate  checkpoint  kinases  (ATR-Chkl  in  mammals  and 
Rad3-Cdsl  and  Mecl-Rad53  in  the  yeasts  Saccharomyces 
pombe  and  Saccharomyces  cerevisiae,  respectively)  (10,  38,  62). 
The  primary  function  of  this  replication  checkpoint  is  to  retain 
the  assembled  replisome  at  stalled  forks  during  arrest:  check¬ 
point  defects  lead  to  catastrophic  replication  fork  collapse  and 
cell  lethality.  However,  recent  studies  have  shown  that  proteins 
directly  involved  in  DNA  replication  as  well  as  recombination 
are  required  for  cells  to  repair  DNA  damage,  restart  replica¬ 
tion,  and  recover  from  arrest  and  must  interact  with  the  check¬ 
point  to  facilitate  these  functions  (10,  38,  62). 

The  minichromosome  maintenance  (MCM)  complex  is  an 
essential  replicative  helicase  that  consists  of  six  related  sub¬ 
units  (MCM2  to  MCM7)  and  is  required  for  the  initiation  and 
elongation  phases  of  DNA  synthesis  (reviewed  in  reference 
22).  The  MCM  helicase  is  therefore  centrally  positioned  to 
monitor  and  maintain  genome  stability  at  the  replication  fork. 
In  budding  yeast,  the  use  of  degron  alleles  of  MCM  subunits. 
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which  degrade  the  protein  upon  a  shift  to  high  temperature, 
has  demonstrated  that  MCM  function  is  required  for  fork 
progression  throughout  S  phase  as  well  as  for  synthesis  to 
resume  after  forks  have  stalled  in  HU  (36).  In  cells  lacking  the 
Rad53  protein  kinase  or  its  S-phase  activator  Mrcl,  treatment 
with  HU  results  in  excessive  DNA  unwinding,  catastrophic 
replication  fork  collapse,  and  an  inability  to  restart  synthesis 
(12,  15,  21,  55,  73).  Under  these  conditions,  MCM  proteins 
remain  chromatin  bound,  but  the  leading  and  lagging  strands 
of  the  replication  fork  become  uncoupled  (12,  34,  55).  In  meta¬ 
zoans,  MCM  subunits  are  phosphorylated  by  the  ATM/ATR 
kinases  (17,  33,  67,  82),  suggesting  that  the  MCM  complex  may 
be  a  target  or  an  effector  of  the  replication  checkpoint. 

However,  MCM  proteins  may  also  promote  S-phase  genome 
stability  through  checkpoint-independent  roles  (reviewed  in 
references  3  and  22).  Although  cellular  levels  of  MCM  pro¬ 
teins  are  estimated  as  10-  to  40-fold  excess  over  the  number  of 
replication  origins,  yeast  temperature-sensitive  mem  mutants 
(mcm-ts)  that  partially  reduce  MCM  protein  levels  exhibit  in¬ 
creased  recombination,  chromosome  loss,  and  checkpoint  sen¬ 
sitivity  (32,  40,  41).  In  addition,  increased  expression  or  am¬ 
plification  of  MCM  genes  is  associated  with  many  types  of 
human  cancers  that  are  characterized  by  genomic  instability 
(reviewed  in  reference  39)  and  the  mutation  or  the  dysregula- 
tion  of  MCM  subunits  can  induce  skin  carcinoma  and  mam¬ 
mary  carcinoma  in  mouse  models  (30,  68).  These  results  sug¬ 
gest  that  the  MCM  complex  plays  a  central  role  in  protecting 
genome  stability  during  S  phase. 

The  stabilization  of  replisome  structure  and  its  recovery 
from  replication  arrest  also  depend  upon  proteins  involved  in 
homologous  recombination  (HR).  In  budding  yeast,  the  RadSl 
strand  exchange  enzyme  and  the  Sgsl  recombination  helicase 
are  required  to  maintain  DNA  polymerase  8  at  stalled  repli¬ 
cation  forks  (16).  Sgsl  is  additionally  involved  in  checkpoint 
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activation  (6).  In  fission  yeast,  the  SpRad22  (ScRad52)  recom¬ 
bination  protein  is  recruited  to  nuclear  foci  immediately  as 
cells  are  released  from  HU  arrest  (48).  Although  fission  yeast 
rad22  and  rhp51  (radSl)  mutants  are  proficient  for  activation 
of  the  replication  checkpoint,  they  still  show  sensitivity  to  HU 
(48,  84).  Together,  these  results  suggest  that  components  of 
the  replisome  interact  with  recombination  proteins  to  maintain 
and  restart  replication  forks. 

Previously,  we  and  others  have  shown  that  most  fission  yeast 
mcm-ts  mutants  accumulate  approximately  2C  DNA  content, 
but  undergo  a  lethal,  checkpoint-dependent  arrest  in  late  S 
phase  or  G2  (18,  23,  40,  41).  In  contrast,  a  degron  allele  of 
mcm4ts  blocks  cells  at  or  near  replication  initiation  (42).  Here, 
we  use  these  different  mcm  alleles  to  examine  the  conse¬ 
quences  to  cells  of  inactivating  MCM  function  during  S  phase 
in  the  absence  or  the  presence  of  forks  stalled  by  HU.  We 
demonstrate  that  the  loss  of  MCM  function  generates  DNA 
breaks,  cell  cycle  arrest,  and  a  loss  of  viability  similar  to  that 
observed  in  mutants  that  undergo  replication  fork  collapse.  Con¬ 
sistent  with  these  results,  we  find  that  Mcm4  interacts  with  the 
checkpoint  protein  kinase  Cdsl  and  undergoes  Cdsl -dependent 
phosphorylation  in  cells  treated  with  HU.  This  result  suggests  that 
MCM  proteins  act  to  maintain  replication  fork  structure  both 
during  normal  S  phase  and  during  S-phase  arrest. 

Our  data  additionally  suggest  that  MCM  function  is  re¬ 
quired  for  proper  recovery  from  replication  arrest  induced  by 
HU.  We  observed  an  interaction  between  the  MCM  complex 
and  the  HR  protein  RhpSl  (RadSl).  Although  RhpSl  and 
other  HR  proteins  are  not  required  to  activate  the  replication 
checkpoint,  to  maintain  fork  structure  in  HU,  or  to  restart 
DNA  synthesis,  we  find  that  the  loss  of  HR  function  results  in 
chromosome  missegregation  following  release  from  HU-in- 
duced  replication  arrest  (this  work;  48).  We  suggest  that  MCM 
proteins  modulate  replication  fork  progression,  arrest,  and  re¬ 
start  and  that  they  couple  these  functions  with  the  repair  of 
DNA  damage  to  protect  S-phase  genome  stability.  Our  anal¬ 
ysis  in  fission  yeast  is  complemented  by  studies  of  mammalian 
cells,  suggesting  that  the  role  of  the  MCM  complex  in  S-phase 
genome  stability  is  conserved. 

MATERIALS  AND  METHODS 

Yeast  strains  and  media.  S.  pombe  strains  (Table  1)  were  constructed  and 
maintained  according  to  standard  procedures  (53).  Cells  were  treated  with  15  or 
20  mM  HU  (Sigma)  for  the  indicated  times  or  with  0.0025%  methyl  methane- 
sulfate  (Sigma)  or  5  ixU/ml  bleomycin  (Sigma)  for  1  h  or  were  irradiated  with  100 
Gy  using  a  ^‘^cobalt  source.  Asynchronous  cultures  or  cultures  synchronized  by 
nitrogen  starvation  (53)  were  shifted  to  the  restrictive  temperature  (36°C)  for 
4  h;  cells  pulse-labeled  with  bromodeoxyuridine  (BrdU)  were  shifted  to  the 
restrictive  temperature  for  3.5  h  and  then  incubated  with  200  |JLg/ml  BrdU 
(Sigma)  for  an  additional  30  min  at  36°C. 

Cell  culture  and  short  interfering  RNA  (siRNA).  HeLa  cells  were  grown  in 
Dulbecco’s  modified  Eagle’s  medium  containing  10%  calf  serum  (Invitrogen), 
with  appropriate  antibiotics.  HeLa  cells  were  synchronized  in  Gi/S  with  2  mM 
thymidine  (Sigma)  for  18  h  and  then  released  into  S  phase  for  3  to  4  h  in  the 
absence  of  added  DNA  damage.  Cells  were  blocked  in  early  S  phase  with  2.5  mM 
HU  for  18  h. 

For  siRNA  experiments,  synthetic  RNA  duplexes  were  transfected  into  HeLa 
cells  to  a  final  concentration  of  20  nM  using  Oligofectamine  (Invitrogen)  ac¬ 
cording  to  the  protocol  provided  by  Dharmacon.  siRNA  duplexes  used  were 
CGACAGCTAGAGTCATTAA  for  MCM4  and  ATCGGATTGTGAAGA 
TGAA  for  MCM7,  synthesized  by  Integrated  DNA  Technologies,  Inc.  Pools  of 
four  additional  siRNA  duplexes  against  MCM4  or  MCM7  were  obtained  from 
Dharmacon.  The  siRNA  negative  control  duplex  was  from  Invitrogen.  Relative 
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TABLE  1.  Yeast  strains  used  in  this  study 


Strain 

Genotype 

Source 

FY7 

h-  972 

Our  stock 

FY72 

h~  ade6-M210  ura4-D18  leul-32 

Our  stock 

FY243 

h~  cdcl9-Pl  ade6-M210  ura4-D18  leul-32 

Our  stock 

FY261 

ura4-D18  leul-32  ade6-M216  canl-1 

Our  stock 

FY364 

h~  cdc21-M68  leul-32  ura4-D18  ade6-M210 
canl  -1 

Our  stock 

FY522 

h~  cdc22-M45  ade6-M210  leul-32  ura4-D18 

Our  stock 

FY641 

h~  cdc6-23  ura4-D18  leul-32  ade6-M210 

Our  stock 

FY865 

h~  \cdsl\\ura4^  ura4-D18  leul-32 

Our  stock 

FY866 

Acdsl::ura4-D18  leul-32 

Our  stock 

FY961 

mis 5 -268  ura4-D18  leul-32  ade6-M210 
canl  -1 

Our  stock 

FY969 

h~  cdc27-K3  ura4-D18  leul  ade6-M210 

Our  stock 

FY986 

hskl-1312  ura4-D18  leul-32  ade6-M216 

Our  stock 

FY999 

hskl-1312  Acdsl::ura4^  ura4-D18  leul-32 

Our  stock 

FY1106 

^rad3\\ura4^  ura4-D18  leul-32 
ade6-M210 

Our  stock 

FY1167 

h~  mcm4-HA::leul^  ura4-D18  leul-32  ade6- 
M210 

Our  stock 

FY1201 

h~  mcm7-98  ura4-D18  leul-32  ade6-M210 

Our  stock 

FY1209 

Arad22\\[hisG  ura4^  hisG]  ade6-M210 
ura4-D18  leul-32 

A.  Pastink 

FY1313 

h~  cdcl7-K42  leul-32  ade6-M210  canl-1 

Our  stock 

FY1374 

h~  cdcl9-Pl  Arad22\\[hisG  ura4^  hisG^ 
ade6-M210  leul-32  ura4-D18 

This  study 

FY1375 

cdc21-M68  ^rad22\\[hisG  ura4'^  hisG^ 
ade6-M210  leul-32  ura4-D18 

This  study 

FY1390 

h~  smt-0  Arhp51::ura4  ura4-D18 

Our  stock 

FY1413 

h~  cdc21-M68 

Our  stock 

FY1507 

smt-0  Arhp51\\ura4^  cdc21-M68 
ade6-M210  ura4-D18 

This  study 

FY1617 

h~  cdc21-M68  Acdsl::ura4^  ura4  leul 
ade6-M210 

Our  stock 

FY1884 

smt-0  Arhp51\\ura4^  ade6-M210  leul-32 
ura4-D18 

This  study 

FY1963 

h~  cdc21-M68  chkl\\ura4'^  ura4-D18 

Our  stock 

FY2234 

Atell::ura4^  ade6-210  ura4-D18  leul-32 

Our  stock 

FY2317 

leul-32::hENTl-leul^{pJAm9) 
his 7-366 : \hsv-tk-his  7^  (p JAH3 1 )  ura4-Dl 8 
ade6-M210 

31 

FY2356 

h~  smt-0  Arhp51:\ura4^  cdc21-HA::leul^ 
ade6-M210  leul-32  ura4-D18 

This  study 

FY2423 

h^  cdc21-HA\\leul^  cdsl::ura4^  ade6-M210 
leul-32  ura4-D18 

Our  stock 

FY2514 

h^  leul-32\\hENTl-leul^ {pi AY\29) 
his7-366\\hsv-tk-his7^ {pi AH31)  cdc21-M68 
leul-32  ura4-D18  ade6-M210 

This  study 

FY2728 

h^  Arad3\\ura4^  Atell\\ura4^  ade6-M210 
leul-32  ura4-D18 

This  study 

FY2878 

h-  rad22:YEP:kanMX4  ade6-M210  ura4-D18 
leul-32 

G.  Freyer 

FY2887 

h^  mcm4  {cdc21-M68)-ts-td::ura4^  ade6- 
M210  leul-32  ura4-D18 

This  study 

FY2906 

h^  rad22-YEP:kanMX4  cdc21-M68  leul-32 
ura4-D18 

This  study 

FY3201 

h  ~  cdcl  9 -PI  Arad32 :  \kanmx  ade6-M21 0 
leul-32  ura4-D18 

This  study 

FY3203 

h^  cdcl9-Pl  Arhp51::ura4'^  ade6-M210 
leul-32  ura4-D18 

This  study 

FY3204 

h^  cdc21-M68  Arad32\\kannvc  ade6-M210 
leul-32  ura4-D18 

This  study 

FY3396 

h^  cdc21-M68-ts-dg\\ura4^  ura4-D18 

This  study 

FY3650 

h^  Acdslv.ura4  Arhp51\\ura4  ura4-D18 
leul-32 

This  study 

FY3651 

h~  smt-0  Acdsl::ura4  Arhp51::ura4  ura4-D18 

This  study 

JBY235 

h~  mcm4-HA::leul^  cdsl-13myc::kanmx 
leul-32  ura4-D18 

This  study 

JBY267 

h  ~  cdsl  -1 3myc : \kanMX  ade6-M21 0  leul  -32 
ura4-D18 

This  study 
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viability  was  determined  as  the  percentage  of  viable  cells  at  48  h  compared  to  the 
percentage  of  viable  cells  at  the  time  of  transfection  (0  h). 

Immunofluorescence.  Spread  nuclei  were  prepared  from  fission  yeast  cells 
according  to  previous  procedures  (29).  Briefly,  cells  were  spheroplasted  in  phos¬ 
phate-buffered  saline  (PBS)  with  0.5  mg/ml  zymolyase  20T  (Seikagaku)  and  0.5 
mg/ml  lysing  enzymes  (Sigma).  Cells  were  then  washed  in  MES  [(2-A-morpho- 
lino)ethane  sulfonic  acid]  buffer  (0.1  M  MES,  1  M  sorbitol,  1  mM  EDTA,  0.5 
mM  MgCl2,  pH  6.4),  resuspended  in  100  |jl1  MES  plus  360  |jl1  4%  paraformal¬ 
dehyde,  and  poured  onto  Colorfrost  microscope  slides  (Thermo  Fisher  Scien¬ 
tific).  Slides  were  rinsed  with  0.4%  Photo-Flo  (Kodak)  and  fixed  with  low  heat. 

S.  pombe  chromatin  fibers  were  prepared  by  a  modification  of  a  protocol  used 
to  visualize  yeast  artificial  chromosome  chromatin  fibers.  The  protocol  generates 
fibers  of  approximately  1.8  kb/ixm  (65).  Cells  were  spheroplasted  in  zymolyase 
mix  (1  M  sorbitol,  60  mM  EDTA,  100  mM  sodium  citrate,  pH  7,  0.5  mg/ml 
zymolyase  20T,  100  mM  (3-mercaptoethanol),  and  then  spheroplasts  were  air 
dried  onto  Colorfrost  slides.  Fibers  were  stretched  by  pipetting  50  |jl1  lysing 
solution  (50  mM  Tris-Cl,  pH  7.4,  25  mM  EDTA,  500  mM  NaCl,  0.1%  [wt/vol] 
Nonidet  P-40,  1%  [wt/vol]  sodium  dodecyl  sulfate  [SDS],  3  mM  (3-mercaptoeth- 
anol)  onto  the  spheroplasts,  waiting  1  to  3  min,  and  then  holding  the  slides 
vertically  to  drain  the  liquid.  Chromatin  fibers  were  fixed  in  4%  formaldehyde  in 
PBS  for  20  min,  rinsed  in  PBS,  and  then  heat  fixed  before  being  immunostained 
with  the  indicated  antibodies. 

Slides  were  blocked  in  PBS  containing  5%  calf  serum  and  0.05%  Tween  20  for 
1  h  at  room  temperature.  Primary  antibody  staining  was  carried  out  overnight  at 
4°C  in  a  humid  chamber.  Antibodies  against  phosphorylated  histone  H2AX 
(phospho-H2AX;  Millipore)  and  BrdU  (Becton  Dickinson)  were  used  at  1:50. 
Immunostaining  for  BrdU  was  performed  as  described  previously  (29),  with 
sequential  immunostaining  with  phospho-H2AX  antibodies.  Anti-Rhp51  anti¬ 
bodies  (14)  and  antihemagglutinin  (anti-HA)  antibodies  (16B12;  Covance)  were 
used  at  1:100.  Primary  antibodies  were  detected  by  using  Alexa  Fluor  488  or 
Alexa  Fluor  594  secondary  antibodies  (Invitrogen)  at  1:500  in  PBS  with  5%  calf 
serum.  DNA  was  counterstained  with  4',6-diamidino-2-phenylindole  (DAPI). 

For  staining  by  terminal  deoxynucleotidyltransferase-mediated  dUTP-biotin 
nick  end  labeling,  spread  S.  pombe  chromosomes  were  labeled  with  dUTP- 
digoxigenin  as  described  previously  (4)  and  then  immunostained  with  an  anti- 
digoxigenin  antibody  (Invitrogen)  and  detected  by  incubation  with  an  anti- 
digoxigenin  secondary  antibody  coupled  to  rhodamine  (Invitrogen).  Slides  were 
immunostained  sequentially  with  antibodies  to  detect  terminal  deoxynucleotidyl- 
transferase-mediated  dUTP-biotin  nick  end  labeling  and  phospho-H2AX. 

HeLa  cells  used  for  indirect  immunofluorescence  experiments  were  grown  on 
glass  coverslips.  Cells  on  coverslips  were  permeabilized  with  PBS  that  contained 
0.1%  Triton  X-100  for  7  min  and  then  fixed  with  3.7%  formaldehyde  in  PBS  for 
15  min.  Cells  were  then  incubated  with  blocking  buffer  (PBS  with  5%  bovine 
serum  and  0.5%  Triton  X-100)  for  30  min.  Stretched  chromatin  fibers  were 
prepared  from  HeLa  cells  as  described  previously  (7);  cells  collected  by 
trypsinization  were  resuspended  in  75  mM  KCl  and  pipetted  onto  Colorfrost 
slides.  Slides  were  incubated  in  lysis  buffer  (25  mM  Tris-Cl,  pH  7.5,  0.5  M  NaCl, 
1%  Triton  X-100,  0.5  M  urea)  for  10  min  and  then  lifted  slowly  out  of  the  buffer 
and  fixed  with  3.7%  formaldehyde  in  PBS  for  15  min.  Chromatin  fibers  generated 
by  this  method  are  approximately  2  kb/ixm  (49).  HeLa  cells  and  chromatin  fibers 
were  immunostained  overnight  at  4°C  in  a  humid  chamber  with  antibodies 
against  phospho-H2AX  (Millipore),  MCM7  (Santa  Cruz),  and  Rad51  (Santa 
Cruz),  each  used  at  1:100  dilution;  BrdU  (Becton  Dickinson)  used  at  1:50 
dilution;  or  cleaved  caspase  3  (Cell  Signaling)  used  at  1:500  dilution.  Primary 
antibodies  were  detected  with  Alexa  Fluor  594  or  Alexa  Fluor  488  secondary 
antibodies  used  at  1:500.  DNA  was  counterstained  with  DAPI. 

Images  were  visualized  and  acquired  using  a  Leica  DMR  microscope  and 
Improvision  software  or  with  a  Leica  6000  microscope  and  Leica  FW4000  soft¬ 
ware. 

Flow  cytometry.  S.  pombe  cells  were  fixed  in  70%  ethanol  and  then  treated  with 
RNase  in  50  mM  sodium  citrate.  Cells  were  resuspended  in  Sytox  green  dye 
(Invitrogen)  at  1:2,500  in  50  mM  sodium  citrate  as  described  previously  (2)  for 
both  flow  cytometry  and  examination  of  nuclear  morphology. 

HeLa  cells  were  collected  by  trypsinization  and  then  fixed  with  70%  ethanol 
for  30  min.  Cells  were  then  washed  in  PBS,  treated  with  RNase  in  PBS,  and  then 
resuspended  in  PBS  with  4  |JLg/ml  propidium  iodide. 

Cells  were  analyzed  on  a  FACScan  cytometer  (Becton  Dickinson)  by  using 
CellQuest  software. 

Viability  assays.  S.  pombe  cells  were  grown  asynchronously  at  the  permissive 
temperature  for  16  h,  and  then  HU  was  added  for  4  h.  For  each  condition,  5,000 
cells  were  counted  and  plated  for  absolute  viability.  The  remaining  cells  were 
then  washed  and  transferred  to  the  same  original  volume  of  medium,  either  with 
or  without  HU,  at  the  permissive  or  restrictive  temperature.  For  each  condition. 


5,000  cells  were  plated  to  assess  relative  viability.  Colonies  were  counted  after  4 
days  of  growth  on  yeast  extract  plus  supplements  (YES)  solid  medium  at  the 
permissive  temperature.  Cellular  DNA  content  was  analyzed  by  flow  cytometry 
as  described  above. 

Immunoprecipitation  and  immunoblotting.  We  prepared  soluble  protein  ly¬ 
sates  from  fission  yeast  cells  by  vortexing  cells  with  glass  beads  in  B88  lysis  buffer 
(20  mM  HEPES  pH  7.0,  50  mM  potassium  acetate,  5  mM  magnesium  acetate, 
100  mM  sorbitol,  0.1%  Triton  X-100)  to  which  2  mM  dithiothreitol,  protease 
inhibitor  cocktail  (Sigma),  50  mM  sodium  orthovanadate,  and  50  mM  (3-glycero¬ 
phosphate  were  added  (2).  A  total  of  0.5  to  1.0  mg  protein  was  used  per 
immunoprecipitation.  Proteins  were  analyzed  by  10%  SDS-polyacrylamide  gel 
electrophoresis  (PAGE),  followed  by  immunoblotting  with  anti-HA  (16B12; 
Covance),  anti-myc  (9el0),  antitubulin  (4A1)  (31),  and  anti-Rhp51  antibodies 
(14),  each  at  a  1:1,000  dilution.  Antibodies  were  covalently  coupled  to  protein  A 
beads  as  described  previously  (2). 

To  assay  the  depletion  of  MCM  protein  and  the  coprecipitation  of  MCM 
proteins  with  Rad51,  HeLa  cells  were  lysed  in  modified  CSK  buffer  [10  mM 
piperazine-A7V'-bis(2-ethanesulfonic  acid),  pH  6.8,  100  mM  NaCl,  300  mM 
sucrose,  1  mM  MgCl2,  1  mM  EGTA,  1  mM  dithiothreitol,  1  mM  phenylmeth- 
ylsulfonyl  fluoride,  10  U/ml  aprotinin,  50  |jlM  NaF,  50  |jlM  (3-glycerophosphate, 
0.5%  Triton  X-100]  for  20  min  at  4°C.  Lysates  were  centrifuged  at  a  low  speed 
for  5  min  and  then  analyzed  by  10%  SDS-PAGE  and  immunoblotting  with 
anti-MCM4  (Becton  Dickinson),  anti-MCM7  (Santa  Cruz),  anti-PCNA  (Santa 
Cruz),  or  anti-Rad51  (Santa  Cruz)  antibodies  (all  used  at  1:1,000). 

Mcm4  mobility  shift.  Whole-cell  protein  extract  was  prepared  with  modifica¬ 
tions  from  Foiani  et  al.  (21a).  Cells  were  disrupted  with  glass  beads  by  vortexing 
in  10%  trichloroacetic  acid,  washed  with  acetone,  and  resuspended  in  Laemmli 
buffer.  A  total  of  150  |jLg  of  protein  was  analyzed  by  8%  SDS-PAGE,  followed  by 
immunoblotting  with  anti-MCM4  antibodies  (66).  For  phosphatase  treatment, 
soluble  protein  lysates  (described  above)  were  incubated  for  20  min  at  30°C  with 
lambda  protein  phosphatase  (New  England  Biolabs).  Reaction  products  were 
analyzed  by  8%  SDS-PAGE,  followed  by  immunoblotting  with  anti-MCM4  (66) 
and  anti-HA  antibodies  (16B12;  Covance). 

RESULTS 

mcm-ts  mutants  cause  irreversible  DNA  damage.  Although 
MCM  activity  is  crucial  throughout  S  phase,  most  mcm-ts  mu¬ 
tants  appear  to  complete  bulk  DNA  replication  (18,  23,  40). 
However,  these  mcm-ts  mutant  cells  activate  a  Chkl -depen¬ 
dent  checkpoint  and  arrest  irreversibly  in  late  S  phase  or  G2 
(41,  47),  raising  the  possibility  that  the  cells  accumulate  DNA 
damage  during  replication  termination  or  in  specific  chromo¬ 
somal  regions.  To  test  this  idea,  we  prepared  spread  nuclei 
from  wild-type  and  mcm-ts  mutant  cells  grown  at  permissive 
and  restrictive  temperatures  and  immunostained  for  phosphory¬ 
lated  histone  H2A  (phospho-H2A),  a  marker  for  DNA  dam¬ 
age.  In  mammalian  cells,  the  histone  H2A  variant  H2AX  is 
rapidly  and  specifically  phosphorylated  in  response  to  DNA 
double-stranded  breaks  (DSBs)  and  this  signal  spreads  to  re¬ 
gions  of  flanking  chromatin  megabases  away  from  the  break  sites 
(64).  A  phosphorylation-specific  H2AX  antibody  (phospho- 
H2AX)  recognizes  a  phospho-SQE  amino  acid  sequence  in  the 
carboxyl  terminus  of  the  H2AX  protein  that  is  highly  conserved  in 
other  species  (64,  69),  including  the  S.  pombe  histone  H2A  ho- 
mologues  Htal  and  Hta2  (54).  Phospho-H2A  foci  in  yeast  form  in 
response  to  chemical  agents  that  cause  DNA  DSBs,  localize  to 
DNA  break  sites,  and  are  dependent  on  checkpoint  protein  ki¬ 
nases,  indicating  that  they  also  mark  DSBs  (54,  83)  (see  Fig.  SIA, 
SIB,  and  SIC  in  the  supplemental  material). 

Wild-type  cells  and  mcm4ts  cells  grown  asynchronously  at 
the  permissive  temperature  contain  very  little  DSB  damage:  we 
detected  phospho-H2A  foci  in  only  a  small  fraction  of  spread 
nuclei,  and  there  were  <5  foci  per  nucleus  (Fig.  lA  and  data 
not  shown).  In  contrast,  we  observed  extensive  DNA  damage 
in  elongated,  arrested  mcm4ts  cells  that  had  been  held  at  the 
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wild  type  mcm4ts 


wild  type  _ mcm4ts 


FIG.  1.  mcm-ts  mutants  generate  DNA  damage  during  replication.  (A)  mcm4ts  cells  generate  DNA  damage.  Spread  nuclei  prepared  from 
wild-type  (FY72)  and  mcm4ts  (FY364)  cells  shifted  to  the  restrictive  temperature  (36°C)  for  4  h  were  immunostained  for  phospho-H2A  (green) 
to  mark  DNA  breaks  and  counterstained  with  DAPI  (blue)  to  detect  the  DNA.  a-pH2A,  anti-phospho-H2A  antibody.  Scale  bar,  10  mM.  Two 
examples  of  wild-type  nuclei  are  shown;  while  most  nuclei  did  not  contain  phospho-H2A  foci,  a  small  number  of  S-phase  nuclei  displayed  a  few 
phospho-H2A  foci.  (B  and  C)  DNA  damage  in  mcm4ts  cells  requires  S-phase  progression.  Wild-type  (FY72),  mcm4ts  (FY364),  and  mcm4ts-degron 
(FY2887)  cells  grown  at  25°C  were  blocked  in  G^  by  nitrogen  starvation  and  then  released  into  S  phase  at  36°C.  At  1-h  intervals,  (B)  DNA  content 
was  measured  by  flow  cytometry,  and  (C)  DNA  damage  was  examined  by  immunostaining  spread  nuclei  for  phospho-H2A.  (D)  DNA  damage  in 
mcm4ts  cells  corresponds  to  sites  of  new  synthesis.  Wild-type  (FY2317)  and  mcm4ts  (FY2514)  cells  were  shifted  to  the  restrictive  temperature  for 
3.5  h  and  then  pulse-labeled  with  BrdU  for  30  min.  Stretched  chromatin  flbers  were  then  prepared  from  these  cells  and  visualized  with  DAPI  (left). 
Chromatin  flbers  were  immunostained  with  antibodies  to  BrdU  and  phospho-H2A  to  detect  sites  of  replicating  DNA  and  DNA  breaks,  respectively 
(right).  Scale  bar,  10  ijum  (approximately  18  kb). 


restrictive  temperature  for  4  h,  with  more  than  80%  of  the 
spread  nuclei  containing  >20  phospho-H2A  foci  per  nucleus 
(Fig.  lA).  In  addition,  the  mcm4ts  mutant  cells  also  displayed 
nuclear  foci  of  the  DNA  repair  proteins  Rhp51  and  Rad22  (the 
fission  yeast  homologues  of  RadSl  and  Rad52)  (see  Fig.  SID 
and  SIE  in  the  supplemental  material;  data  not  shown),  which 
have  previously  been  used  as  molecular  markers  for  DNA 
damage  in  fission  yeast  (20,  48),  supporting  the  conclusion  that 
DNA  breaks  occur  in  these  cells. 

We  also  examined  other  mutants  affecting  the  MCM  com¬ 
plex,  and  additional  replication  mutants,  for  the  formation  of 
DNA  damage  at  the  restrictive  temperature.  All  mutants  an¬ 
alyzed  contained  Rhp51  foci,  which  mark  single-stranded 
breaks  and  DSBs  (13,  60,  76).  Mutations  affecting  subunits  of 
DNA  polymerase  8  {cdc6  and  cdc27)  and  the  cdc22  ribonucle¬ 
otide  reductase  mutant,  all  of  which  retain  viability  upon  a  shift 
to  high  temperature,  displayed  few  phospho-H2A  foci.  In  con¬ 


trast,  mcm-ts  mutants  and  the  DNA  ligase  cdcll  mutant  all 
contained  extensive  phospho-H2A  foci,  indicating  DNA  DSBs, 
and  suffered  severe  losses  of  viability.  Thus,  phospho-H2A  foci 
correlate  with  inviability  (see  Fig.  SID  in  the  supplemental 
material;  Table  2).  Together,  these  data  suggest  that  stability  of 
the  MCMs  in  the  replisome  is  important  for  cell  viability  fol¬ 
lowing  replication  arrest. 

DNA  damage  in  mcm-ts  cells  requires  replication  progres¬ 
sion.  While  mcm-ts  alleles  complete  bulk  DNA  synthesis  (18, 
23,  40,  41),  a  degron  allele  of  mcm4ts  blocks  initiation,  result¬ 
ing  in  limited  or  no  DNA  synthesis  (42).  To  determine  whether 
the  lethal  DNA  damage  in  mcm-ts  cells  occurred  as  a  result  of 
defects  in  initiation  or  replication  elongation,  we  compared 
phospho-H2A  focus  formation  in  spread  nuclei  from  wild-type, 
mcm4ts,  and  mcm4ts-degron  cells  following  release  from  G^/S 
arrest  into  S  phase  at  the  restrictive  temperature.  We  observed 
a  dramatic  accumulation  of  phospho-H2A  foci  in  mcm4ts  cells 
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TABLE  2.  DNA  damage  in  fission  yeast  replication  mutants'* 


Characteristic  or 
mutation 

Strain 

Wild-type  gene  function 

Arrest  point 

Relative 
viability  (%) 

%  Spread  nuclei  with  foci  at  36°C 

Phospho- 

H2A 

Rhp51 

Phospho-H2A 
and  Rhp51 

Wild  type 

FY72 

4 

4 

4 

mcm2  (cdcl9) 

FY243 

MCM  complex: 

Late  S 

16.3 

84 

84 

84 

replicative  helicase 

mcm4  (cdc21) 

FY364 

Late  S 

29.5 

94 

94 

94 

mcm6  (mis5) 

FY961 

Late  S 

18.8 

75 

75 

75 

mcm? 

FY1201 

Late  S 

15.2 

85 

85 

85 

cdc6 

FY641 

DNA  polymerase 

Late  S 

111.1 

17 

38 

17 

cdcl7 

FY1313 

DNA  ligase 

Late  S 

29.9 

90 

90 

90 

cdc22 

FY522 

Ribonucleotide  reductase 

Early  S 

102.3 

17 

55 

17 

cdc27 

FY969 

DNA  polymerase 

Late  S 

131.3 

10 

46 

10 

"  Yeast  replication  mutants  that  arrest  irreversibly  at  the  restrictive  temperature  generate  phospho-H2A  foci.  Cultures  were  grown  to  early  log  phase  at  25°C  and  then  shifted 
to  36°C  for  4  h.  Arrest  points  were  determined  by  flow  cytometry  and  analysis  of  cell  morphology  (data  not  shown).  At  0  and  4  h  after  shift  to  36°C,  aliquots  of  cells  were  plated 
and  incubated  at  the  permissive  temperature.  Relative  viabiUty  was  determined  as  the  number  of  viable  colonies  from  the  4-h  time  point  compared  to  the  number  at  the  0-h 
time  point.  Spread  nuclei  prepared  from  arrested  cells  were  immunostained  for  phospho-H2A  and  Rhp51.  At  least  300  spread  nuclei  per  strain  were  examined. 


as  they  progressed  through  S  phase,  and  these  persisted  in  cells 
that  arrested  (Fig.  IB  and  C).  In  contrast,  the  mcm4ts-degron 
cells  displayed  few  phospho-H2A  foci,  and  the  inefficient  DNA 
synthesis  coupled  with  cell  elongation  resulted  in  apparent 
delayed  and  incomplete  S-phase  progression  by  flow  cytom¬ 
etry.  This  observation  suggests  that  the  DNA  damage  in 
mcm4ts  cells  occurs  during  replication  elongation,  which  would 
be  consistent  with  replication  fork  collapse.  In  agreement  with 
this  suggestion,  the  phospho-H2A  foci  observed  in  mcm4ts 
cells  were  suppressed  by  the  addition  of  HU  (<5%  of  cells  with 
foci  [data  not  shown]),  which  delays  cells  in  early  S  phase  with 
replication  initiated  from  only  a  subset  of  origins  and  active 
forks  stabilized  by  the  checkpoint.  In  comparison,  a  low  per¬ 
centage  of  wild-type  cells  exhibited  a  few  phospho-H2A  foci 
during  S  phase,  but  these  disappeared  as  the  cells  completed  S 
phase  (Fig.  1C),  suggesting  that  low  levels  of  spontaneous 
DNA  damage  or  fork  collapse  may  occur  during  replication 
but  that  such  damage  is  normally  recognized  and  repaired. 

Genome-wide  replication  fork  breakage  can  be  monitored 
using  pulsed-field  gel  electrophoresis  gels  that  detect  frag¬ 
mented  DNA  (for  an  example,  see  reference  51).  However,  we 
found  that  the  majority  of  chromosomes  prepared  from 
mcm-ts  cells  at  the  restrictive  temperature  did  not  enter 
pulsed-field  gel  electrophoresis  gels  (41;  data  not  shown),  sug¬ 
gesting  that  they  contain  topologically  complex  chromosomal 
structures,  such  as  replication  bubbles  or  other  S-phase  inter¬ 
mediates  that  impair  electrophoretic  mobility.  The  same  result 
has  also  been  observed  for  other  replication  mutants  that  ar¬ 
rest  S-phase  progression  (for  an  example,  see  reference  81). 
Although  replication  fork  breakage  at  specific  sites  can  be 
monitored  by  two-dimensional  gel  electrophoresis,  the  DNA 
breaks  in  mcm-ts  cells  occur  late  in  S  phase  and  are  unlikely  to 
be  associated  with  replication  origins  or  other  defined  se¬ 
quences.  Therefore,  as  an  alternative  method  to  examine  DNA 
breaks  associated  with  replicating  DNA  in  mcm4ts  mutants,  we 
pulse-labeled  cells  with  BrdU  and  then  prepared  stretched 
chromatin  fibers  which  we  costained  for  BrdU,  phospho-H2A, 
and  DAPI  (Fig.  ID).  The  BrdU  incorporation  pattern  on  fibers 
from  both  wild-type  and  mcm4ts  cells  was  discontinuous  and 
punctate,  suggesting  that  DNA  synthesis  had  initiated  from 
multiple,  discrete  origins  (Fig.  ID).  For  chromatin  fibers  pre¬ 


pared  from  wild-type  cells  pulse-labeled  with  BrdU  (Fig.  ID), 
just  9%  of  BrdU-labeled  fibers  displayed  phospho-H2A  stain¬ 
ing,  indicating  that  the  chromatin-stretching  procedure  itself 
and  the  short  incubation  with  BrdU  did  not  cause  DNA  DSBs. 
In  contrast,  63%  of  BrdU-stained  chromatin  fibers  prepared 
from  mcm4ts  cells  also  exhibited  extensive  phospho-H2A  stain¬ 
ing  and  these  damage  foci  overlapped  with  regions  of  BrdU 
incorporation  (Fig.  ID).  These  results  suggest  that  mcm-ts- 
induced  breaks  are  associated  with  regions  of  newly  synthe¬ 
sized  DNA,  consistent  with  the  model  that  they  result  from 
replication  fork  collapse.  Importantly,  these  fibers  are  chroma- 
tinized  and  fully  proteinated;  the  DNA  break  sites  may  be  held 
together  by  protein-protein  interactions,  such  as  those  sug¬ 
gested  for  the  recombination  complex  Mrell-Rad50-Nbsl 
(45),  or  may  represent  DSBs  in  the  process  of  repair. 

MCM  proteins  are  required  for  replication  restart  after 
arrest.  These  observations  suggest  that  one  function  of  the 
MCM  complex  is  to  protect  replication  fork  structure  during 
normal  S  phase.  Because  MCM  subunits  can  be  phosphory- 
lated  by  checkpoint  kinases  (17,  33,  67,  82),  we  considered  the 
possibility  that  the  MCM  complex  also  contributes  to  fork 
stability  during  replication  arrest.  To  test  this  idea,  we  treated 
wild-type,  mcm4ts,  or  mcm4ts-degron  mutant  cells  with  HU, 
which  stalls  replication  forks  and  activates  the  replication 
checkpoint.  We  next  inactivated  the  Mcm4  subunit  by  shifting 
to  the  restrictive  temperature  while  maintaining  HU  arrest. 
We  then  asked  whether  this  treatment  generated  DNA  DSBs 
and  whether  the  cells  could  recover  from  the  arrest  (Fig.  2A). 
Wild-type  cells  with  normal  Mcm4  function  arrested  in  HU 
with  a  primarily  1C  DNA  content,  and  the  cells  completed  S 
phase  and  resumed  cycling  upon  release  from  HU  (Fig.  2B  and 
D).  Few  phospho-H2A  foci  were  observed  at  either  the  per¬ 
missive  or  the  restrictive  temperature,  consistent  with  our  pre¬ 
vious  observations  that  suggest  limited,  spontaneous  DNA 
damage  can  occur  as  a  normal  part  of  S-phase  progression 
(Fig.  2C;  see  Fig.  SI  in  the  supplemental  material).  In  a  man¬ 
ner  similar  to  that  of  wild-type  cells,  mcm4ts-degron  cells  also 
arrested  in  HU,  with  few  phospho-H2A  foci  (Fig.  2B  and  C). 
However,  upon  a  shift  to  the  restrictive  temperature,  we  ob¬ 
served  that  mcm4ts-degron  cells  generated  numerous  phospho- 
H2A  foci,  even  though  the  cells  were  maintained  in  HU  and 
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FIG.  2.  MCM  proteins  are  required  for  recovery  from  HU.  (A)  Schematic  of  the  experimental  protocol.  Wild-type  (FY72),  mcm4ts  (FY364), 
and  mcm4ts-degron  (FY2887)  cells  were  grown  asynchronously  (white  arrows)  and  then  arrested  in  HU  for  4  h  (black  arrows).  Cells  were  shifted 
to  the  restrictive  temperature  of  36°C  and  then  released  from  HU.  Samples  were  taken  at  time  points  tl,  t2,  t3,  and  t4.  (B)  Cellular  DNA  content 
was  analyzed  by  flow  cytometry.  All  strains  contained  approximately  1C  DNA  content  in  HU.  The  wild-type  and  mcm4ts  cells,  but  not  the 
mcm4ts-degron  cells,  completed  bulk  synthesis  upon  release  from  HU.  (C)  Inactivation  of  Mcm4  resulted  in  DNA  breaks  in  HU.  Wild-type  and 
mcm4ts-degron  mutants  were  arrested  in  HU  at  25°C  (t2)  and  shifted  to  36°C  while  maintained  in  HU  (t3).  DNA  DSBs  were  detected  by 
phospho-H2A  immunostaining  as  described  in  the  legend  to  Fig.  1.  (D)  Cellular  morphology  was  examined  by  staining  with  Sytox  green  to  detect 
the  DNA.  Scale  bars,  10  ijum. 


remained  arrested  (Fig.  2B  and  C).  This  result  suggests  that 
the  loss  of  Mcm4  function  after  replication  forks  have  formed 
and  stalled  causes  extensive  DNA  damage.  Upon  release  from 
HU  at  the  restrictive  temperature,  the  mcm4ts-degron  mutants 
showed  little  additional  DNA  accumulation  and  no  sign  of 
division,  consistent  with  the  essential  role  for  the  MCM  com¬ 
plex  throughout  DNA  replication  (36). 

Unexpectedly,  we  found  that  mcm4ts  cells  blocked  in  HU 
and  then  shifted  to  a  high  temperature  could  recover  from 
arrest,  enter  mitosis,  and  remain  viable,  with  an  increase  in  cell 
number  similar  to  that  of  the  wild  type  (178.7%  ±  11.3% 
[mean  ±  standard  deviation]  relative  viability  for  mcm4ts  com¬ 
pared  to  178.0%  ±  6.4%  for  the  wild  type)  (Fig.  2A).  This 
result  was  surprising  because  mcm4ts  mutants  shifted  asyn¬ 
chronously  to  the  restrictive  temperature,  or  released  from 
nitrogen  starvation  to  the  restrictive  temperature,  accumulate 
phospho-H2A  foci  and  lose  viability  (Fig.  1).  The  mcm4ts  mu¬ 
tant  contains  a  single  amino  acid  substitution  (Leu238  to  Pro), 
a  relatively  modest  lesion  that  may  cause  loss  of  function  at  a 
high  temperature  through  a  conformational  change  in  the  pro¬ 
tein  (42).  We  hypothesized  that  activation  of  the  replication 
checkpoint  during  HU  arrest  might  rescue  the  defect  associ¬ 
ated  with  mcm4ts  by  stabilizing  the  Mcm4ts  mutant  protein. 

Mcm4  is  a  target  of  the  replication  checkpoint.  We  there¬ 
fore  tested  whether  Mcm4  interacts  with  the  Cdsl  checkpoint 
kinase  and  whether  Mcm4  is  a  target  of  Cdsl -dependent  phos¬ 


phorylation.  We  found  that  Mcm4  and  Cdsl  coimmunopre- 
cipitate  in  the  presence  of  HU  (Fig.  3A).  We  also  observed  a 
mobility  shift  of  the  Mcm4  protein  in  extracts  from  HU-treated 
cells  (Fig.  3B  to  E).  This  shifted  band  could  be  collapsed  by 
phosphatase  treatment,  suggesting  that  it  represents  the  phos¬ 
phorylation  of  Mcm4  (see  Fig.  S2A  in  the  supplemental  ma¬ 
terial).  Identical  results  were  observed  using  a  ^cdsl  strain 
with  untagged  Mcm4  (FY865)  (data  not  shown).  Although  we 
observed  the  Mcm4  mobility  shift  in  both  wild-type  and 
mcm4ts  strains,  it  did  not  occur  in  mcm4ts-degron  strains,  even 
under  permissive  conditions.  Interestingly,  the  mcm4ts-degron 
mutant  lost  viability  in  HU,  prior  to  any  temperature  shift, 
suggesting  that  it  has  a  defect  in  the  replication  checkpoint 
response  even  at  the  permissive  temperature  (see  Fig.  S2B, 
S2C,  and  S2D  in  the  supplemental  material). 

Consistent  with  the  possibility  that  Mcm4  is  a  target  of  the 
replication  checkpoint,  we  found  that  the  Mcm4  mobility  shift 
depends  upon  Cdsl  (Fig.  3E)  as  well  as  on  the  Hskl  (ScCdc7) 
kinase  (see  Fig.  S2B,  S2C,  and  S2D  in  the  supplemental  ma¬ 
terial).  Hskl  is  required  for  full  Cdsl  activation  (75)  and  is 
itself  a  potential  Cdsl  target  (72).  Simple  overexpression  of 
Cdsl  was  unable  to  rescue  mcm4ts  inviability  (data  not  shown), 
indicating  that  activation  of  the  checkpoint  is  required.  These 
data  suggest  that  Mcm4  undergoes  Cdsl -dependent  phosphory¬ 
lation  that  may  contribute  to  replication  checkpoint  function 
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FIG.  3.  Cdsl-dependent  phosphorylation  of  Mcm4.  (A)  Mcm4  coimmunoprecipitates  Cdsl.  Protein  lysates  prepared  from  wild-type  (FY72), 
mcm4-HA  (FY1167),  mcm4-HA  Arhp51  (FY2356),  cdsl-myc  (JBY267),  and  mcm4-HA  cdsl-myc  (JBY235)  strains  incubated  with  15  mM  HU  for 
4  h  were  immunoprecipitated  (IP)  with  anti-HA  antibodies  or  control  (C)  antibody  (antitubulin);  immunoprecipitates  were  then  immunoblotted 
(IB)  to  detect  Cdsl-myc  and  Mcm4-HA.  (B).  Schematic  of  the  experimental  protocol.  Cells  were  grown  asynchronously  (white  arrow)  and  arrested 
in  HU  for  4  h  (black  arrow).  Cells  were  then  released  into  medium  with  or  without  HU,  either  at  25°C  or  36°C.  Samples  were  taken  at  time  points 
tl,  t2,  t3,  t4,  t5,  and  t6.  (C)  Cellular  DNA  content  was  analyzed  in  wild-type  (FY7),  mcm4ts  (FY1413),  Acdsl  (FY2423),  Acdsl  mcm4ts  (FY1617), 
and  mcm4ts-degron  (FY3396)  cells  by  flow  cytometry.  (D)  Relative  viability  of  the  strains  in  panel  C  was  determined  as  described  in  Materials  and 
Methods.  (E)  Mcm4  gel  mobility  shift  in  HU.  Trichloroacetic  acid-precipitated  lysates  from  the  strains  indicated  in  panel  C  were  immunoblotted 
(IB)  for  Mcm4  protein  with  anti-Mcm4  antibody  (a-Mcm4).  The  shifted  (phosphorylated  [P])  form  of  Mcm4  is  indicated  by  the  white  arrowhead. 
The  Mcm4-degron  protein  exhibits  reduced  mobility  relative  to  wild-type  Mcm4  because  of  the  large  N-terminal  degron  tag.  Strain  FY2423 
contains  Mcm4HA,  but  identical  results  were  observed  using  FY866,  which  has  untagged  Mcm4  (data  not  shown). 


and  further  imply  that  the  Mcm4ts  protein  is  stabilized  through 
Cdsl-dependent  modification  in  HU. 

MCM  proteins  link  checkpoint  activation  to  recovery.  In 
prokaryotes,  it  is  well  established  that  replication  forks  that 
arrest  or  break  can  restart  and  resume  synthesis  through  re¬ 
combination  (reviewed  in  references  27  and  50).  However, 


evidence  for  a  similar  pathway  in  eukaryotic  cells  remains 
controversial.  We  observed  that  mcm-ts  Arhp51  and  mcm-ts 
^rad22  double  mutants  were  unable  to  grow  at  intermediate 
temperatures  that  were  permissive  for  either  single  mutant 
(Table  3).  This  observation  suggests  that  HR  is  required  to 
repair  the  fork-dependent  DNA  breaks  in  mcm-ts  cells  or  that 
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TABLE  3.  Genetic  interactions  between  mutants  of  the  MCM 
complex  and  DNA  repair  mutants"* 


Strain 

Genotype 

Phenotype 

FY1507 

mcm4ts  Arhp51 

Slow/reduced  growth  at  29°C,  32°C, 
and  34°C 

FY1375 

mcm4ts  ^rad22 

Slow/reduced  growth  at  29°C,  32°C, 
and  34°C 

FY3203 

mcm2ts  Arhp51 

Slow/reduced  growth  at  29°C,  32°C, 
and  34°C 

FY1374 

mcm2ts  ^rad22 

Slow/reduced  growth  at  29°C,  32°C, 
and  34°C 

FY3204 

mcm4ts  ^md32 

No  genetic  interaction 

FY3201 

mcm2ts  t^md32 

No  genetic  interaction 

"HR  mutants  decrease  the  ability  of  mcm4ts  to  grow  at  semipermissive  tem¬ 
peratures.  Double  mutant  strains  were  compared  against  the  single  mutants  for 
growth  on  solid  rich  medium  (YES)  incubated  at  25°C,  29°C,  32°C,  and  34°C  for 
4  days. 


HR  proteins  cooperate  with  the  replication  machinery  to  pro¬ 
tect  replication  fork  structure.  Given  that  both  MCM  proteins 
and  HR  proteins  are  predicted  to  act  at  or  near  replication 
forks,  we  tested  for  physical  association  of  these  proteins.  In- 
triguingly,  we  found  that  RhpSl  (ScRadSl)  coimmunoprecipi- 
tated  with  Mcm4  from  cells  treated  with  HU  when  both  pro¬ 
teins  were  expressed  at  endogenous  levels  (Fig.  4A;  see  Fig.  S3 
in  the  supplemental  material).  The  interaction  between  RhpSl 
and  Mcm4  was  still  observed  when  lysates  were  pretreated  with 
DNase  (data  not  shown),  indicating  that  it  was  not  an  indirect 
effect  of  the  association  of  both  proteins  with  the  same  DNA 
fragments.  In  contrast,  little  or  no  RhpSl  bound  to  Mcm4  in 
asynchronous  cultures  (Fig.  4A)  or  in  cells  released  synchro¬ 
nously  from  a  block  into  S  phase  in  the  absence  of  HU  (see 
Fig.  S3  in  the  supplemental  material).  It  remains  a  possibility 
that  RhpSl  and  Mcm4  do  associate  during  normal  S  phase  in 
the  subset  of  cells  that  generate  RhpSl  or  phospho-H2A  foci, 
which  may  reflect  sites  of  spontaneous  DNA  damage.  We  also 
observed  that  Mcm4  coimmunoprecipitates  overexpressed 
Rad22-yeflow  fluorescent  protein  (YEP)  from  cells  treated 
with  HU  (data  not  shown),  suggesting  that  the  MCM  complex 
may  interact  with  repair  complexes.  These  interactions  could 
link  fork  protection  and  replication  restart  with  recovery  from 
arrest.  Therefore,  we  examined  the  dynamics  of  recovery  from 
replication  arrest  in  recombination  mutants  in  more  detail. 

When  cells  are  treated  with  HU,  replication  forks  stall,  lead¬ 
ing  to  Cdsl  activation  and  replication  arrest.  In  the  ^cdsl 
mutant,  this  checkpoint  cannot  be  activated  to  stabilize  fork 
structure  and  the  forks  collapse,  leading  to  rapid  loss  of  via¬ 
bility.  This  is  also  the  case  in  Arat/5  mutants,  because  the  Rad3 
kinase  is  required  for  Cdsl  phosphorylation  and  activation.  In 
contrast,  while  the  ^rhp51  and  b^md22  recombination  mutants 
are  sensitive  to  growth  on  solid  medium  that  contains  HU,  they 
display  only  a  modest  loss  of  viability  during  continuous  HU 
exposure  (48,  84)  (Fig.  4B  and  data  not  shown).  The  viability 
loss  in  b^rhpSl  and  I^md22  cells  occurs  after  3  to  4  h  in  HU, 
which  corresponds  to  the  time  point  when  wild-type  cells  begin 
to  adapt  to  the  drug  and  resume  cell  cycle  progression  (Fig. 
4B).  Because  the  loss  of  viability  in  HR  mutants  is  modest 
compared  to  that  in  checkpoint  mutants  and  occurs  at  a  later 
time  point,  we  infer  that  HR  proteins  have  a  distinct,  nones¬ 
sential  role  in  recovery  from  HU.  We  next  assessed  phospho- 


H2A  focus  formation  in  checkpoint  and  recombination  mutant 
cells  that  were  blocked  in  HU  and  then  released.  Nearly  all  of 
the  ^cdsl  and  b^mdd  checkpoint  mutant  cells  exposed  to  HU 
contained  substantial  numbers  of  phospho-H2A  foci  (Fig.  4C), 
consistent  with  previous  evidence  that  replication  forks  col¬ 
lapse  in  the  absence  of  replication  checkpoint  activation  (48, 
56,  73).  In  the  recombination  mutant  cells,  however,  a  much 
lower  percentage  of  spread  nuclei  displayed  phospho-H2A  foci 
and  there  were  fewer  foci  per  nucleus  (<5  foci/nucleus)  than  in 
the  checkpoint  mutant  cells  (>20  foci/nucleus)  (Fig.  4C). 
These  results  suggest  that  replication  forks  do  not  collapse  in 
HR  mutants,  leading  us  to  conclude  that  HR  proteins  are  not 
required  for  replication  fork  protection  during  HU-induced 
arrest. 

We  also  examined  recruitment  of  the  HR  proteins  them¬ 
selves  by  using  a  Rad22-YFP  fluorescent  fusion  protein  de¬ 
scribed  previously  (20,  48)  as  well  as  by  using  indirect  immu¬ 
nofluorescence  to  detect  Rad22  and  RhpSl  (Fig.  4D  and  E).  In 
asynchronous  cultures,  a  low  frequency  of  wild-type  cells  dis¬ 
played  nuclear  foci  of  RhpSl  and  Rad22  (Fig.  4A  and  data  not 
shown),  a  result  similar  to  what  we  observed  for  phospho-H2A; 
these  foci  may  represent  sites  of  spontaneous  DNA  damage  or 
stalled  replication  forks  in  S-phase  cells  (37).  However,  the 
percentage  of  cells  with  Rad22  or  RhpSl  foci  was  increased  in 
cells  arrested  in  HU  and  released.  In  asynchronous  cultures, 
8.3%  ±  0.57%  of  cells  displayed  one  Rad22-YFP  focus,  while 
fewer  than  1%  of  cells  exhibited  more  than  one  Rad22-YFP 
focus  (Fig.  4D).  After  2  h  in  HU,  24%  ±  2.0%  of  cells  dis¬ 
played  at  least  one  Rad22-YFP  focus.  By  4  h  in  HU,  when 
nearly  all  of  the  cells  were  elongated  and  wild-type  cells  were 
starting  to  adapt  and  reenter  the  cell  cycle,  82.0%  ±  3.2%  of 
cells  exhibited  at  least  one  Rad22-YFP  focus  (Fig.  4D).  Nu¬ 
clear  foci  of  RhpSl  were  detected  in  cells  treated  with  HU  with 
timing  similar  to  that  for  Rad22-YFP  (Fig.  4E).  While  just 
6.0%  ±  2.8%  of  spread  nuclei  prepared  from  cells  from  asyn¬ 
chronous  cultures  displayed  RhpSl  foci,  18.5%  ±  2.1%  of 
spread  nuclei  prepared  from  cells  incubated  with  HU  for  2  h 
and  82.0%  ±  2.8%  of  spread  nuclei  prepared  from  cells  incu¬ 
bated  with  HU  for  4  h  contained  multiple  RhpSl  foci  (Fig.  4C). 
Consistent  with  results  of  previous  reports  (44,  48),  we  found 
that  the  localization  of  recombination  proteins  to  nuclear  foci 
did  not  occur  immediately  upon  treatment  with  HU  but  oc¬ 
curred  during  the  recovery  stage,  consistent  with  the  timing  of 
viability  loss  in  HR  mutants  (Fig.  4B). 

To  investigate  how  recombination  contributes  to  survival  of 
replication  arrest,  we  blocked  cells  in  HU  for  4  h,  released  the 
cells  into  fresh  medium  without  HU,  and  assayed  DNA  con¬ 
tent,  cell  morphology,  viability,  and  DNA  damage  during  ar¬ 
rest  and  recovery.  Wild-type  cells  elongated  and  arrested  with 
a  ~1C  DNA  content  on  treatment  with  HU  and  completed 
bulk  DNA  replication  and  retained  normal  cell  morphology 
and  viability  when  released  from  HU  (Fig.  5A).  Similar  results 
were  observed  for  mcmdts  cells  (Fig.  3C  and  data  not  shown) 
as  described  above.  In  contrast,  /Srad3  checkpoint  mutant  cells 
were  unable  to  restrain  cell  division  in  the  presence  of  HU, 
resulting  in  immediate  chromosome  missegregation  and  cell 
lethality;  IHi^cdsl  mutant  cells  initially  arrested  due  to  activation 
of  the  Chkl -dependent  checkpoint,  but  then  they  also  divided 
in  the  presence  of  DNA  damage  (8,  48)  (Fig.  5).  Significantly, 
we  observed  that  78%  and  77%  of  the  missegregation  events  in 


Downloaded  from  mcb.asm.org  by  on  February  14,  2008 


1732  BAILIS  ET  AL. 


Mol.  Cell.  Biol. 


FIG.  4.  Recombination  proteins  in  the  HU  response  (A)  Mcm4  coprecipitates  Rhp51  in  HU.  Protein  lysates  prepared  from  wild-type  (FY72), 
mcm4-HA  Arhp51  (FY2356),  and  mcm4-HA  (FY1167)  cells  untreated  or  treated  with  15  mM  HU  for  4  h  (+HU)  were  immunoprecipitated  (IP) 
with  anti-HA  antibodies,  an  irrelevant  control  antibody  (lane  C),  or  protein  A  beads  only  (lane  B).  Immunoprecipitates  and  whole-cell  lysates  were 
analyzed  by  10%  SDS-PAGE,  followed  by  immunoblotting  (IB)  with  antibodies  to  detect  Mcm4-HA  (anti-HA)  and  Rhp51  (anti-Rhp51).  (B)  HR 
mutants  display  a  modest  loss  of  viability  in  HU  compared  to  checkpoint  mutants.  Wild-type  (FY72),  mcm4ts  (FY364),  ^cdsl  (FY865),  ^rad3 
(FY1106),  bjad22  (FY1209),  and  ^rhp51  (FY1884)  strains  were  grown  in  liquid  cultures  at  the  permissive  temperature  (25°C)  in  the  presence  of 
15  mM  HU.  Aliquots  of  cells  were  plated  to  YES  medium  at  the  indicated  time  points  and  incubated  at  25°C  for  4  days;  relative  viability  was 
calculated  as  the  number  of  colonies  at  each  time  point  relative  to  the  number  at  the  zero  time  point.  (C)  HR  mutants  generate  fewer 
phospho-H2A  damage  foci  than  checkpoint  mutants,  suggesting  that  most  replication  forks  are  stabilized.  Asynchronous  cultures  of  wild-type 
(FY72),  ^cdsl  (FY865),  \rad3  (FY1106),  ^rad22  (FY1209),  and  Arhp51  (FY1884)  cells  grown  at  25°C  were  treated  with  15  mM  HU  for  4  h  and 
then  released  into  fresh  medium  without  HU.  At  the  indicated  time  points,  spread  nuclei  were  prepared  and  immunostained  for  phospho-H2A. 
Phospho-H2A  focus  formation  in  the  \rad3  mutant  is  due  to  activity  of  the  Tell  protein  kinase,  which  acts  redundantly  with  Rad3  (54,  83)  (see 
Fig.  SI  in  the  supplemental  material).  (D)  Rad22-YFP  foci  form  in  cells  recovering  from  HU-induced  arrest.  Asynchronous  (asynch)  cultures  of 
wild-type  (FY72)  and  Rad22-YFP  (FY2878)  cells  were  incubated  with  20  mM  HU  for  4  h  at  25°C.  At  0,  2,  and  4  h  after  the  addition  of  HU, 
Rad22-YFP  localization  was  examined  and  quantitated  in  live  cells  (FY2878).  (E)  Rhp51  localization  was  assayed  by  anti-Rhp51  immunostaining 
of  spread  nuclei  prepared  from  wild-type  cells  (strain  FY72)  treated  with  HU  for  0,  2,  or  4  h.  The  percentage  spread  nuclei  that  contained  Rhp51 
nuclear  foci  is  indicated  in  the  text.  DNA  was  counterstained  with  DAPI.  Scale  bar,  10  |jlM. 


Arat/5  and  /^cdsl  mutant  cells,  respectively,  displayed  the  “cell 
untimely  torn”  (cut)  phenotype,  where  the  cells  divided  before 
replication  was  completed.  Cell  division  was  confirmed  by  im¬ 
munostaining  for  tubulin  to  detect  the  mitotic  spindles  (data 
not  shown). 

In  contrast,  the  /^rhp51  and  ^rad22  cells  elongated  and  ar¬ 
rested  in  HU  and  completed  bulk  DNA  synthesis  upon  release 
from  HU  (Fig.  5B),  consistent  with  the  findings  of  a  recent 
report  (48).  These  results  indicate  that  the  recombination  mu¬ 
tant  cells  were  competent  to  activate  the  checkpoint  as  well  as 
to  resume  replication  during  recovery.  However,  we  found  that 
approximately  25%  of  HR  mutant  cells  missegregated  chro¬ 
mosomes  in  the  subsequent  mitosis  (Fig.  5B).  In  contrast  to  the 


checkpoint  mutants,  the  main  type  of  chromosome  missegre- 
gation  in  HR  mutants  was  unequal  division  of  chromosomes  or 
chromatids  (82%  of  ^rhp51  cells  and  79%  of  b^md22  cells), 
suggesting  a  defect  in  spindle  attachment,  sister  chromatid 
cohesion,  or  segregation.  Interestingly,  Meister  et  al.  (48)  re¬ 
ported  that  the  deletion  of  rhp31  ^  allowed  some  ^cdsl  cells  to 
complete  bulk  DNA  replication,  suggesting  that  one  function 
of  HR  proteins  during  replication  arrest  and  recovery  is  the 
resolution  of  nonproductive  DNA  structures  that  form  as  a 
consequence  of  replication  fork  stalling  and  collapse.  We  ob¬ 
served  that  ^cdsl  /Srhp51  double  mutants  were  heterogeneous 
in  colony  size.  During  acute  HU  exposure  and  release,  the 
double  mutant  was  as  inviable  as  the  ^cdsl  single  mutant 
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FIG.  5.  HR  proteins  have  a  role  in  recovery  from  HU  distinct  from  that  in  the  checkpoint.  Asynchronous  cultures  of  wild-type  (FY72),  t^cdsl 
(FY865),  /Srad3  (FY1106),  /Srad22  (FY1209),  and  Arhp51  (FY1884)  cells  grown  at  25°C  were  treated  with  15  mM  HU  for  4  h,  then  released  into 
fresh  medium  without  HU,  and  grown  for  another  4  h  at  25°C.  Cells  were  analyzed  for  (A)  DNA  content  by  flow  cytometry  and  (B)  nuclear 
morphology  by  staining  with  Sytox  green.  Arrowheads  indicate  examples  of  chromosome  missegregation  events  (detailed  in  the  text);  averages  and 
standard  deviations  were  calculated  from  three  independent  experiments.  Scale  bar,  10  |jlM. 


alone,  but  during  chronic  exposure,  the  double  mutant  was 
more  sensitive  to  HU,  indicating  that  continued  lack  of  rhp51 
reduces  the  fitness  of  the  ^cdsl  mutant,  consistent  with  inde¬ 
pendent  effects  on  recovery  (see  Fig.  S4  in  the  supplemental 
material). 

Down-regulation  of  human  MCM  proteins  by  siRNA  results 
in  DNA  damage  and  cell  death.  MCM  proteins  are  highly 
overexpressed  in  many  human  cancers  and  have  also  been 
implicated  more  directly  in  cancer  etiology  (63,  77).  We  there¬ 
fore  tested  whether  MCM  proteins  promote  S-phase  genome 
stability  in  human  cells  by  using  siRNA  to  reduce  the  levels  of 
MCM4  and  MCM7  proteins  in  HeLa  cells  (Fig.  6A).  Cells  that 
were  mock  transfected  continued  to  grow  and  divide,  with  a 
265.0%  ±  60.2%  increase  in  cell  viability  by  48  h.  In  contrast, 
cells  transfected  with  MCM  siRNA  exhibited  a  modest  loss  of 
viability,  with  just  41.6%  ±  19.3%  (MCM4)  or  53.2%  ±  13.1% 
(MCM7)  of  the  cells  remaining  viable  after  48  h.  In  experi¬ 
ments  where  MCM  protein  levels  were  not  as  strongly  reduced, 
cell  growth  was  still  inhibited  relative  to  that  of  cells  treated 
with  either  mock  siRNA  or  nontargeting  control  siRNA,  al¬ 
though  the  overall  viability  of  the  population  of  cells  did  not 
decrease  (data  not  shown).  This  result  is  consistent  with  ob¬ 
servations  in  yeast  that  cells  with  a  modest  reduction  in  MCM 
protein  levels  still  undergo  bulk  DNA  replication,  whereas 
complete  depletion  of  MCM  proteins  is  needed  to  prevent 
DNA  synthesis  (24,  41).  In  our  experiments,  MCM4  or  MCM7 
siRNA  did  not  substantially  alter  the  distribution  of  G^,  S,  and 


G2/M  cells  as  analyzed  by  flow  cytometry  (Fig.  6B),  although  a 
higher  proportion  of  cells  displayed  abnormal  nuclear  mor¬ 
phology,  such  as  multinucleate  cells  or  micronuclei  (about  30% 
of  cells  treated  with  MCM4  siRNA  or  MCM7  siRNA  com¬ 
pared  to  about  10%  of  mock-treated  cells)  (Fig.  6C).  We  also 
found  that  the  siRNA  oligonucleotides  used  to  reduce  MCM7 
levels  also  partially  decreased  levels  of  MCM4,  suggesting  that 
loss  of  function  of  one  of  the  MCM  subunits  may  affect  the 
stability  of  other  components  of  the  MCM  complex.  We  con¬ 
clude  that  the  loss  of  MCM  function  in  HeLa  cells,  as  in  yeast, 
leads  to  changes  in  chromosome  structure  or  cell  cycle  pro¬ 
gression  that  result  in  cell  inviability. 

To  determine  whether  the  loss  of  viability  resulted  from 
DNA  damage,  we  assessed  the  focus  formation  of  phosphory- 
lated  histone  H2AX  (phospho-H2AX),  the  mammalian  coun¬ 
terpart  to  yeast  phospho-H2A  (64).  While  just  3.0%  ±  1.0%  of 
mock-treated  cells  contained  phospho-H2AX  foci  after  48  h 
(Fig.  6D),  78.0%  ±  2.8%  of  cells  treated  with  MCM4  siRNA 
and  75.0%  ±  3.0%  of  cells  treated  with  MCM7  siRNA  con¬ 
tained  phospho-H2AX  foci  (Fig.  6D).  To  investigate  whether 
this  DNA  damage  localized  to  replication  sites,  we  pulse-la¬ 
beled  siRNA-treated  cells  with  BrdU,  prepared  stretched  chro¬ 
matin  fibers,  and  immunostained  for  RadSl  (the  Rhp51 
homologue)  and  BrdU  (Fig.  6E).  We  found  that  67%  of  BrdU- 
stained  chromatin  fibers  from  cells  treated  with  MCM  siRNA 
also  displayed  RadSl  foci  and  that  the  RadSl  foci  overlapped 
with  BrdU  foci.  In  contrast,  just  8%  of  BrdU-stained  chroma- 
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FIG.  6.  siRNA  against  MCM4  or  MCM7  in  HeLa  cells  generates  DNA  damage  and  cell  inviability.  HeLa  cells  were  transfected  with  buffer  only 
(mock)  or  with  siRNA  against  MCM4  or  MCM7,  and  the  cells  were  analyzed  at  0  and  48  h  after  transfection.  (A)  Proteins  were  analyzed  by  10% 
SDS-PAGE,  followed  by  immunoblotting  (IB)  with  antibodies  to  detect  MCM4  or  MCM7,  and  subsequently  immunoblotted  with  anti-PCNA 
antibody  as  a  loading  control.  (B)  DNA  content  was  assessed  by  flow  cytometry.  HeLa  cells  mock  treated  or  treated  with  MCM4  siRNA  or  MCM7 
siRNA  for  48  h  were  stained  with  (C)  DAPI  to  visualize  the  DNA,  and  (D)  phospho-H2AX  and  DAPI  to  detect  DNA  damage.  (E)  The  DNA 
damage  correlates  with  loss  of  MCM  localization  and  corresponds  to  sites  of  new  DNA  synthesis.  Stretched  chromatin  flbers  prepared  from  HeLa 
cells  48  h  after  siRNA  treatment  were  immunostained  for  MCM7  and  phospho-H2AX  and  counterstained  with  DAPI  (left);  chromatin  flbers 
prepared  from  cells  pulse-labeled  with  BrdU  for  1  h  were  immunostained  for  BrdU  and  Rad51  and  counterstained  with  DAPI.  Scale  bar,  10  ijum 
(approximately  20  kb).  (F)  MCM  siRNA  results  in  cell  death  by  apoptosis.  Cells  mock  treated  or  treated  with  siRNA  against  MCM4  or  MCM7 
were  immunostained  for  cleaved  caspase  3  and  counterstained  with  DAPI.  Scale  bars,  10  iJum. 


tin  fibers  from  mock- treated  cells  also  contained  RadSl  foci. 
We  suggest  that  as  in  yeast,  the  loss  of  MCM  function  in  HeLa 
cells  is  associated  with  DNA  breaks  at  sites  of  DNA  synthesis. 
Moreover,  the  DNA  damage  resulted  in  cell  death  by  apop¬ 
tosis,  as  cells  treated  with  siRNA  against  MCM4  or  MCM7, 
but  not  mock-treated  cells,  stained  for  cleaved  caspase  3,  a 
marker  for  activation  of  the  apoptotic  pathway  (Fig.  6F).  We 
observed  similar  phenotypes  of  DNA  damage  and  cell  death  by 
using  a  pool  of  four  additional  siRNA  duplexes  directed 
against  either  MCM4  or  MCM7,  but  not  with  a  nontargeting 
siRNA  duplex  (data  not  shown),  indicating  that  the  pheno¬ 
types  we  observed  are  due  to  specific  depletion  of  these  pro¬ 
teins.  These  findings  support  a  role  for  the  MCM  proteins  in 


S-phase  genome  stability  that  is  conserved  from  yeast  to  hu¬ 
man  cells. 

Interactions  between  MCM  proteins  and  RadSl  in  human 
cells.  In  fission  yeast  cells  treated  with  HU,  MCM  proteins 
interact  with  the  RadSl  homologue  RhpSl  (Fig.  5)  and  this 
association  may  be  important  for  cell  survival  of  replication 
arrest.  To  test  whether  this  interaction  is  conserved  in  human 
cells,  we  examined  whether  endogenous  MCM  proteins  can 
coimmunoprecipitate  with  RadSl  in  HeLa  cells  treated  with 
HU.  We  found  that  the  interaction  between  MCM  proteins 
and  RadSl  in  HeLa  cells  was  not  specific  to  cells  treated  with 
HU;  it  also  occurred  in  asynchronous  cultures,  as  reported 
recently  (70),  as  well  as  in  cultures  of  cells  released  synchro- 
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FIG.  7.  MCM  proteins  interact  with  Rad51  in  HeLa  cells.  (A)  Whole-cell  lysates  prepared  from  asynchronous  cultures  of  HeLa  cells  (asynch), 
cells  blocked  in  2  mM  thymidine  for  18  h  and  then  released  into  S  phase  for  3  h  (S  phase),  and  cells  blocked  in  early  S  phase  with  2.5  mM  HU 
for  18  h  (HU)  were  immunoprecipitated  with  anti-Rad51  antibodies  or  with  an  irrelevant,  control  antibody  (anti-myc).  The  anti-myc  immuno- 
precipitation  (IP)  was  performed  on  cells  blocked  in  HU.  Rad51  and  associated  proteins  were  analyzed  by  10%  SDS-PAGE,  followed  by 
immunoblotting  to  detect  Rad51  and  Mcm7.  (B)  Coimmunoprecipitation  of  MCM7  with  Rad51  from  HeLa  cells  is  not  dependent  on  DNA. 
Anti-Rad51  antibodies  immunoprecipitate  Rad51  and  MCM7  from  both  untreated  HeLa  cell  lysates  and  lysates  that  were  treated  with  ethidium 
bromide  (EtBr).  AS,  asynchronous  cells;  S,  cells  released  from  thymidine  block  into  S  phase;  HU,  cells  arrested  in  hydroxyurea.  C,  control  IP 
(anti-myc). 


nously  into  S  phase  from  a  thymidine  block  (Fig.  7A).  Further¬ 
more,  the  association  between  RadSl  and  Mcm7  was  still  ob¬ 
served  when  immunoprecipitations  were  carried  out  in  the 
presence  of  ethidium  bromide  (Fig.  7B)  or  DNase  (data  not 
shown),  indicating  that  the  interaction  was  due  to  protein-to- 
protein  associations  and  not  mediated  by  DNA.  However, 
while  we  observed  nuclear  MCM7  localization  in  S-phase  cells 
both  in  the  absence  and  the  presence  of  HU,  Rad51  nuclear 
foci  were  found  primarily  in  S-phase  cells  that  had  been  treated 
with  HU  (data  not  shown).  Thus,  while  MCM  proteins  interact 
with  Rad51/Rhp51  in  both  human  and  yeast  cells,  the  interac¬ 
tion  in  human  cells  does  not  appear  to  be  driven  by  activation 
of  the  replication  checkpoint.  However,  it  is  possible  that  the 
interaction  between  MCM  proteins  and  RadSl  in  untreated 
cells  might  reflect  a  role  in  response  to  spontaneous  S-phase 
DNA  damage  or  stalled  replication  forks  that  may  occur  more 
frequently  in  the  human  genome. 

DISCUSSION 

MCM  proteins  promote  replication  fork  stabilization  and 
recovery.  In  response  to  DNA  damage  or  blocks  to  DNA 
replication,  checkpoint  activation  promotes  replication  fork 
arrest  and  also  stabilizes  fork  structure  so  that  the  leading  and 
lagging  DNA  strands  and  associated  proteins  remain  coupled 
(12,  15,  34,  46).  In  this  study,  we  analyze  the  role  of  the  MCM 
complex  in  maintaining  genome  stability  both  during  normal  S 
phase  and  in  response  to  replication  fork  stalling  and  check¬ 
point  activation  caused  by  treatment  with  HU.  Using  fission 
yeast  as  a  model  system,  we  provide  evidence  for  distinct  steps 
in  the  recovery  process.  Once  the  replication  checkpoint  is 
activated,  MCM  proteins  are  required  to  maintain  the  integrity 
of  arrested  replication  forks.  MCM  proteins  are  also  needed 
for  cells  to  resume  synthesis  following  S-phase  arrest.  We  and 
others  have  identified  a  distinct  role  for  recombination  pro¬ 
teins  in  the  recovery  process  (this  work;  1,  48).  Here  we  show 
that  recombination  is  important  for  accurate  chromosome  seg¬ 
regation  in  the  subsequent  mitosis,  and  we  suggest  that  recom¬ 
bination  repairs  DNA  breaks  that  form  spontaneously  during 
replication  fork  arrest  and  restart.  Thus,  the  checkpoint,  MCM 
activity,  and  recombination  define  separable  events  that  con¬ 


tribute  to  S-phase  genome  stability  and  the  survival  of  repli¬ 
cation  arrest. 

Fission  yeast  mcm-ts  mutants  undergo  lethal  Chkl-depen- 
dent  arrest  even  though  cells  appear  to  complete  bulk  DNA 
synthesis  (23,  40,  41,  47).  Previously,  we  showed  that  wild-type 
MCMs  are  delocalized  from  the  chromatin  and  lost  from  the 
nucleus  when  mcm-ts  mutants  are  shifted  to  the  restrictive 
temperature  (61).  Here,  we  demonstrate  that  the  inactivation 
of  mcm-ts  also  generates  extensive  DNA  breaks  during  S  phase 
identified  by  extensive  phospho-H2A  staining,  and  we  ob¬ 
served  similar  results  for  human  cells  where  MCM  function  is 
inhibited  by  siRNA.  That  the  breaks  are  linked  to  active  DNA 
synthesis  and  replication  forks  is  suggested  because  the  breaks 
are  not  observed  in  a  mcm4ts-degron  mutant  that  blocks  rep¬ 
lication  elongation  and  because  the  DNA  damage  foci  are 
found  in  regions  of  newly  synthesized  DNA.  Thus,  we  propose 
that  the  loss  of  MCM  function  causes  replication  fork  break¬ 
down  that  generates  DNA  DSBs.  This  damage  may  be  limited 
to  certain  regions  of  the  genome  or  to  regions  of  replication 
termination,  and  genome-wide  analysis  will  be  required  to  dis¬ 
tinguish  where  it  occurs. 

Interestingly,  while  the  DNA  damage  in  mcm-ts  mutants 
activates  a  DNA  damage  checkpoint  and  results  in  the  recruit¬ 
ment  of  recombination  proteins,  such  as  Rhp51  and  Rad22,  to 
nuclear  foci,  the  DNA  breaks  apparently  remain  unrepaired. 
In  contrast,  in  prokaryotes,  broken  replication  forks  are  a  sub¬ 
strate  for  repair  by  recombination,  which  reassembles  the 
replisome  at  the  fork  and  promotes  restart  of  synthesis  (re¬ 
viewed  in  references  27  and  50).  One  possibility  is  that  the 
amount  of  DNA  damage  generated  in  mcm-ts  mutants  is  ex¬ 
tensive  and  may  exceed  the  repair  capacity  of  the  cell.  In 
support  of  this  model,  we  observed  that  the  overproduction  of 
Rad22  in  mcm-ts  cells  improved  cell  viability;  however,  the 
overproduction  of  Rad22  also  caused  a  delay  in  cell  cycle 
progression,  suggesting  that  fewer  DNA  breaks  were  formed  at 
any  given  time  (J.  M.  Bailis  and  S.  L.  Forsburg,  unpublished 
data). 

MCM  proteins  have  checkpoint-dependent  and  -indepen¬ 
dent  roles  in  genome  stability.  We  also  And  that  MCM  pro¬ 
teins  contribute  to  fork  stability  and  repair  during  replication 
arrest  induced  by  HU.  While  cells  blocked  in  HU  display  little 
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or  no  DNA  DSBs,  as  assessed  by  phospho-H2A  focus  forma¬ 
tion,  extensive  DNA  breakage  results  when  MCM  function  is 
subsequently  lost  from  stalled  forks  using  the  mcm4ts-degron 
allele.  This  result  indicates  that  the  MCM  complex  is  required 
to  maintain  fork  stability  in  HU  arrest,  even  after  the  replica¬ 
tion  checkpoint  kinase  Cdsl  has  been  activated.  Unexpectedly, 
we  found  that  the  Mcm4ts  (nondegron)  protein  retained  its 
function  following  HU  arrest,  correlating  with  Cdsl-dependent 
phosphorylation.  In  contrast,  the  mcm4ts-degron  strain  is  HU 
sensitive  even  at  the  permissive  temperature  and  shows  re¬ 
duced  modification  of  the  Mcm4ts-degron  protein.  Together, 
these  data  raise  the  possibility  that  Mcm4  is  a  Cdsl  substrate. 
Two  motifs  for  Cdsl  family  kinases  have  been  identified,  RXX- 
S/T  (58)  and  S/T-F  (71),  and  both  motifs  exist  in  the  N  termi¬ 
nus  of  SpMcm4.  In  vertebrates,  MCM  subunits  are  substrates 
of  the  checkpoint  protein  kinases  ATR  and  ATM  (17,  33,  67, 
82).  Consistent  with  a  role  for  checkpoint  kinases  in  regulating 
MCM  proteins,  in  S.  cerevisiae  rad53  or  mrcl  mutants,  MCM 
proteins  are  delocalized  from  the  replication  fork  in  HU  (16). 
Taken  together,  these  data  suggest  the  MCM  complex  is  a 
proximal  effector  of  the  Cdsl  checkpoint,  is  required  for  rep¬ 
lication  fork  stability,  and  may  be  a  direct  Cdsl  substrate. 

An  alternative  possibility  is  that  mcm4ts  mutants  recover 
from  HU  because  all  replication  origins  have  already  fired.  In 
this  case,  Mcm4  might  normally  regulate  the  timing  of  repli¬ 
cation  origin  firing  and  no  additional  origins  would  need  to 
initiate  after  the  cells  are  released  from  HU.  However,  pre¬ 
liminary  data  suggest  that  some  origins  that  normally  fire  early 
do  not  fire  in  mcm4ts  (D.  D.  Luche  and  S.  L.  Forsburg,  un¬ 
published  data).  Moreover,  when  the  number  of  active  origins 
is  reduced  in  budding  yeast,  such  as  by  mutation  of  the  Cdc7 
kinase  or  MCM  subunits,  cells  can  still  complete  S  phase, 
although  more  slowly  (9,  19,  28).  In  addition,  our  comparison 
of  mcm4ts  mutants  and  the  mcm4-degron  mutant  shifted  to  the 
restrictive  temperature  suggests  that  the  damage  does  not  oc¬ 
cur  during  replication  initiation  (Fig.  1). 

In  many  respects,  mcm-ts  mutants  resemble  ^cdsl  mutants 
treated  with  HU:  both  result  in  the  formation  of  phospho-H2A 
foci  and  the  loss  of  viability,  suggesting  that  both  cause  repli¬ 
cation  fork  collapse  (Fig.  1  and  4)  (11,  48,  57).  However,  there 
are  important  differences.  Although  mutants  treated 

with  HU  activate  Chkl  and  the  DNA  damage  checkpoint,  cell 
cycle  arrest  is  transient  (11)  and  the  cells  undergo  a  lethal 
mitosis  without  completing  S  phase  (11,  48,  57)  (Fig.  4).  More¬ 
over,  Ac(i57  cells  in  HU  retain  the  capacity  to  resume  synthesis, 
because  at  least  some  cells  can  complete  bulk  DNA  replication 
if  rhp51^  is  deleted  (48).  In  contrast,  mcm-ts  mutants  undergo 
Chkl -dependent  arrest  that  is  irreversible  (23,  41,  47)  and  the 
arrest  is  not  suppressed  by  the  deletion  of  rhp51^  or  rad22^ 
(J.  M.  Bailis  and  S.  L.  Forsburg,  unpublished  data).  These  data 
suggest  that  while  Cdsl  activity  is  essential  to  protect  the  rep¬ 
lication  fork,  possibly  in  part  by  stabilizing  the  MCM  proteins, 
MCM  proteins  may  play  additional  downstream  roles  in  recov¬ 
ery  and  reentry  into  the  cell  cycle  that  are  Cdsl  independent. 

Interactions  between  MCM  proteins  and  recombination 
during  recovery.  We  found  that  fission  yeast  MCM  proteins 
interact  with  Rhp51  specifically  during  HU  arrest,  while  hu¬ 
man  MCM  proteins  associate  with  Rad51  both  in  the  presence 
and  the  absence  of  HU  (70;  this  work),  suggesting  that  one 
such  additional  role  in  recovery  may  be  the  recruitment  of 


recombination  factors  to  forks  that  break  during  recovery.  An 
analysis  of  Arhp51  or  ^rad22  mutants  following  release  from 
HU  suggests  that  cells  are  proficient  to  resume  DNA  synthesis, 
but  defective  in  chromosome  segregation.  While  the  precise 
defect  has  not  been  determined,  this  result  is  reminiscent  of 
those  of  recent  studies  of  budding  yeast  that  suggest  recombi¬ 
nation  proteins,  such  as  the  MRN  complex  and  Rad52,  are 
required  to  tether  broken  ends  together  to  facilitate  proper 
segregation  in  mitosis  (35,  45)  and  suggests  that  the  role  of  HR 
proteins  in  HU  recovery  may  be  a  structural  function  linked  to 
replication  fork  restart  that  is  required  for  proper  chromosome 
segregation.  In  support  of  this  idea,  S.  cerevisiae  Rad51  is 
needed  to  maintain  the  association  of  DNA  polymerase  e  with 
the  replisome  during  arrest  (6). 

However,  recombination  could  also  promote  recovery  by 
resolving  DNA  intermediates.  While  Sp  ^rhp51  mutants  com¬ 
plete  bulk  DNA  replication  after  release  from  HU,  as  assessed 
by  fiow  cytometry  (48;  this  work),  the  chromosomes  fail  to 
enter  pulsed-field  gels  (52),  suggesting  that  they  contain  non¬ 
linear  DNA  intermediates.  While  at  least  some  of  these  inter¬ 
mediate  structures  are  associated  with  DSBs  (this  work),  they 
do  not  activate  a  G2/M  damage  checkpoint.  Although  a  similar 
phenotype  was  reported  for  other  recombination  mutants, 
such  as  rad60  (52)  and  smc6  (1),  those  mutants  may  not  have 
activated  the  checkpoint  because  the  cells  arrested  prior  to 
mitosis.  Thus,  while  the  DNA  damage  in  HR  mutants  does  not 
activate  a  checkpoint  leading  to  G2/M-phase  arrest,  the  breaks 
may  cause  uneven  separation  and  segregation  of  the  DNA 
during  mitosis.  It  remains  to  be  determined  whether  the  role  of 
HR  proteins  in  recovery  represents  a  requirement  for  active 
recombination  or  a  structural  role  in  maintaining  chromosome 
integrity  as  observed  in  DSB  repair  mutants  in  budding  yeast 
(35,  45).  Alternatively,  DNA  breaks  in  ^rhp51  cells  may  not  be 
in  a  chromatin  context  that  can  be  recognized  by  the  check¬ 
point.  This  model  is  supported  by  evidence  that  some  2srhp51 
IHscdsl  cells  can  restart  replication  and  suggests  that  recombi¬ 
nation  proteins  may  function  both  to  repair  damaged  forks  and 
to  resolve  DNA  intermediates  formed  when  forks  stall  (re¬ 
viewed  in  references  10  and  38). 

The  number  of  Rhp51  and  Rad22  foci  that  we  observe  in 
cells  treated  with  HU  is  far  fewer  than  the  predicted  number  of 
active  forks  after  early  firing  of  origins  in  HU  (21),  consistent 
with  the  possibility  that  recombination  proteins  only  associate 
with  or  accumulate  at  a  subset  of  replication  forks  or  that 
damaged  forks  are  assembled  into  repair  centers  as  suggested 
for  multiple  DSBs  in  S.  cerevisiae  (43).  We  suggest  that  the 
interaction  of  Rhp51  with  MCM  proteins  during  replication 
fork  arrest  and  recovery  may  be  important  to  maintain  fork 
stability  during  replication  restart.  However,  in  mcm-ts  cells, 
the  recruitment  of  Rhp51  to  DNA  breaks  may  instead  refiect 
an  MCM-independent  attempt  to  repair  forks  that  have  al¬ 
ready  broken  but  cannot  be  processed  appropriately.  Thus,  we 
speculate  there  are  two  forms  of  HR  protein  recruitment  dur¬ 
ing  S  phase:  fork  recovery,  via  MCM  proteins,  and  DSB  repair, 
which  is  MCM  independent. 

MCM  proteins  may  also  contribute  to  an  analogous  function 
as  Swil  and  Swi3,  which  have  been  proposed  to  form  a  fork 
protection  complex  (57).  We  observed  that  fission  yeast 
mcm2ts  is  synthetically  lethal  with  mutants  of  swil  (W.  P. 
Dolan  and  S.  L.  Forsburg,  unpublished  data).  In  S.  cerevisiae. 
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MCM  proteins  physically  associate  with  the  Swil  homologue, 
Tofl  (55),  and  with  Mrcl,  a  Cdsl-adaptor  protein  which  local¬ 
izes  to  replication  forks  and  also  functions  as  a  replication 
processivity  factor  (55,  59,  74,  79).  In  HU-treated  Amrc7  or 
ditofl  cells,  replication  forks  do  not  collapse,  but  the  MCM 
helicase  becomes  separated  from  the  polymerase  on  the  chro¬ 
matin  as  the  leading  and  lagging  DNA  strands  become  uncou¬ 
pled  (16,  34,  78).  Although  Mrcl  contributes  to  checkpoint 
activation,  the  replisome  decoupling  in  mrcl  mutants  is  distinct 
from  the  irreparable  fork  collapse  of  rad53  (or  Sp  cdsl)  check¬ 
point  mutants  (16,  79).  Because  MCM  proteins  interact  with 
both  Mrcl  and  Cdsl/Rad53,  we  suggest  that  the  MCM  com¬ 
plex  contributes  to  S-phase  genome  stability  at  multiple  levels. 

The  dysregulation  of  checkpoint,  replication,  and  repair 
pathways  is  strongly  linked  with  the  capacity  for  proliferation 
and  development  of  cancer  (reviewed  in  references  25  and  80). 
The  MCM  complex  has  recently  been  used  as  a  molecular 
marker  for  malignant  cells  and  has  also  been  proposed  as  a 
target  for  cancer  therapy  (reviewed  in  reference  25).  The  cen¬ 
tral  role  of  the  MCMs  is  further  supported  by  the  recent 
identification  of  an  allele  of  Mcm4  linked  to  mouse  mammary 
adenocarcinoma  (68).  Additionally,  overexpression  of  the 
Mcm7  subunit  alone  can  accelerate  oncogene-induced  tumor- 
igenesis  (30).  It  will  be  important  to  determine  the  mechanisms 
by  which  interactions  between  the  MCM,  checkpoint,  and  re¬ 
combination  proteins  modulate  fork  arrest  and  restart  with 
repair  and  to  regulate  whether  S-phase  arrest  is  survivable  or 
lethal  to  cells. 
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ABSTRACT 

Schizosaccharomyces  pombe  Mstl  is  a  member  of  the  MYST  family  of  histone  acetyltransferases  and  is  the 
likely  ortholog  of  Saccharomyces  cerevisiae  Ls3l  and  human  TipGO  (KAT5).  We  have  isolated  a  temperature- 
sensitive  allele  of  this  essential  gene,  mstl  cells  show  a  pleiotropic  phenotype  at  the  restrictive  temperature. 
They  are  sensitive  to  a  variety  of  DNA-damaging  agents  and  to  the  spindle  poison  thiabendazole,  mstl  has  an 
increased  frequency  of  Rad22  repair  foci,  suggesting  endogenous  damage.  Two-hybrid  results  show  that 
Mstl  interacts  with  a  number  of  proteins  involved  in  chromosome  integrity  and  centromere  function, 
including  the  methyltransferase  Skbl,  the  recombination  mediator  Rad22  (Sc  Rad52),  the  chromatin 
assembly  factor  Hipl  (Sc  Hirl),  and  the  Mscl  protein  related  to  a  family  of  histone  demethylases.  mstl 
mutant  sensitivity  to  hydroxyurea  suggests  a  defect  in  recovery  following  HU  arrest.  We  conclude  that  Mstl 
plays  essential  roles  in  maintenance  of  genome  stability  and  recovery  from  DNA  damage. 


Many  events  in  DNA  metabolism,  including  repli¬ 
cation  and  repair,  are  modulated  by  the  pattern 
of  different  histone  modifications  in  the  chromatin, 
leading  to  the  speculation  that  a  “histone  code”  pro¬ 
vides  epigenetic  information  that  facilitates  numerous 
J]  chromatin  functions  (reviewed  in  Jenum^ein  and  Allis 
2001;  Kouzarides  2007).  Histone  acetylation  is  associ¬ 
ated  with  diverse  chromatin  functions  (reviewed  in 
Kouzarides  2007;  Lee  and  Workman  2007).  Histone 
acetylation  changes  association  of  DNA  with  the  under¬ 
lying  nucleosomes  (Shogren-Knaak  et  al  2006)  and 
also  creates  specific  binding  sites  for  proteins  involved 
in  a  variety  of  DNA  transactions  (reviewed  in  Kouzarides 
2007;  Lee  and  Workman  2007).  Histone  acetyltrans¬ 
ferase  (HAT)  enzymes  can  be  separated  into  several 
sequence-specific  subfamilies  that  differ  in  their  sub¬ 
strates  and  in  the  composition  of  their  associated  com¬ 
plexes.  The  MYST  family  of  HATs  (reviewed  by  Utley 
and  Cote  2003)  has  been  extensively  characterized  in 
budding  yeast,  which  has  three  members  (reviewed  in 
Laeon  et  al  2007) .  We  have  reported  two  MYST  family 
proteins  in  fission  yeast  (Gomez  et  al  2005):  Mst2,  the 
likely  ortholog  of  Saccharomyces  cerevisiae  Sas2,  is  a 
nonessential  gene  required  for  normal  telomere  silenc¬ 
ing  (Gomez  et  al  2005).  Mstl,  the  likely  ortholog  of  S. 
cerevisiae  Esal,  is  an  essential  gene  and  is  the  subject  of 
this  report. 

^  Corresponding  author:  Molecular  and  Computational  Biology  Section, 
1050  Childs  Way,  RRI 201,  University  of  Southern  California,  Los  Angeles, 
CA  90089-2910.  E-mail:  forsburg@usc.edu 


Work  from  several  experimental  systems  shows  that 
ScEsal  histone  acetyltransferase  is  a  catalytic  subunit  of 
a  large  protein  complex  called  NuA4  and  a  smaller 
subcomplex.  Piccolo,  that  is  part  of  NuA4  (Allard  et  al 
1999;  Boudreault  et  al  2003;  Eberharter  et  al  2005; 
Selleck  et  al  2005).  The  NuA4  complex  acetylates 
histone  H4  and  H2A  (Smith  et  al  1998;  Allard  et  al 
1999;  Galarneau  et  al  2000).  It  may  also  be  associated 
with  acetylation  of  the  histone  variant  H2AZ  (Keogh 
et  al  2006;  Millar  et  al  2006) .  Esal  family  HATs  contain 
a  chromodomain,  a  motif  associated  with  binding  to 
methylated  histones  (reviewed  in  de  la  Cruz  et  al 

2005) .  While  this  suggests  that  some  aspect  of  Esal 
function  is  mediated  by  binding  methylated  histone  (s), 
the  target  for  such  binding  remains  elusive.  The  mam¬ 
malian  homolog  is  Tip60  (reviewed  in  Squatrito  et  al 

2006) .  Under  a  new  initiative  to  regularize  nomencla¬ 
ture  in  the  chromatin  field,  Mstl,  Esal,  and  Tip60  are 
defined  as  the  KAT5  family  of  histone  acetyltransferases 
(Allis  et  al  2007) . 

ScESAl  is  an  essential  gene,  and  loss  of  function 
results  in  cell-cycle-specific  arrest  after  replication  but 
before  mitosis  (Smith  et  al  1998;  Clarke  et  al  1999), 
although  a  recent  report  suggests  that  it  may  also  be 
required  for  cell  cycle  entry  (Early  et  al  2004).  The 
roles  of  5cEsal  are  diverse,  and  recent  studies  have  also 
linked  it  to  the  spindle  assembly  checkpoint  (Le 
Masson  et  al  2003)  and  the  morphogenesis  checkpoint 
(Le  Masson  et  al  2003;  Ruault  and  Pillus  2006). 
Genome-level  analyses  in  budding  yeast  indicate  that 
5cEsal  activity  is  associated  with  active  gene  expression 
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89  (Galarneau  et  al  2000;  Vogelauer  et  al  2002).  Para- 

90  doxically,  however,  H4  acetylation  is  also  associated  with 

91  decreased  gene  expression  (Deckert  and  Struhl  2001). 

92  Recent  studies  suggest  that  5cEsal  may  also  repress  activity 

93  of  specific  gene  regions,  particularly  the  heterochromatic 

94  domains  of  the  telomeres  and  the  rDNA  (Clarke  et  al 

95  2006). 

96  Histone  acetylation  by  the  NuA4  complex  has  a  role  in 

97  DNA  double-strand  break  (DSB)  repair  (reviewed  in 

98  VAN  Attikum  and  Gasser  2005b;  Squatrito  et  al 

99  2006).  Increased  histone  acetylation  near  the  damage 

100  site  occurs  (Yu  al  2005)  and  acetylation  of  H4  is  linked 

101  to  activation  of  homologous  recombination  and  non- 

102  homologous  end-joining  (Bird  et  al  2002;  Tamburini 

103  and  Tyler  2005).  5cEsal  is  recruited  to  DNA  breaks 

104  (Bird  et  al  2002;  Tamburini  and  Tyler  2005).  One  of 

105  the  major  responses  to  DNA  double-strand  breaks  is 

106  phosphorylation  of  the  histone  variant  H2AX  by  the 

107  damage  checkpoint  kinases  ATM  and  ATR  (reviewed  in 

108  Foster  and  Downs  2005;  Morrison  and  Shen  2005). 

1 09  H2AX  phosphorylation  and  recruitment  of  NuA4  occur 

110  with  similar  timing  (Downs  et  al  2004).  Interestingly, 

111  two  chromatin-remodeling  complexes,  INO80  and 

112  SWRI,  are  also  recruited  to  H2AX,  at  least  in  part 

113  through  the  common  Arp4  subunit  that  they  share  with 

114  NuA4  (Downs  et  al  2004;  Robert  et  al  2006).  NuA4, 

115  INO80,  and  SWRI  are  proposed  to  facilitate  exchange 

116  of  phospho  H2AX,  ultimately  downregulating  the 

117  damage  signal  (Morrison  et  al  2004;  Tsukuda  et  al 

118  2005;  VAN  Attikum  and  Gasser  2005b;  Papamichos- 

119  Chronakis  et  al  2006).  Recent  studies  have  implicated 

120  the  NuA4  complex  in  regulation  and  modification  of  an 

121  additional  histone  H2A  variant  called  H2AZ  (Babiarz 

122  et  al  2006;  Keogh  et  al  2006;  Millar  and  Grunstein 

123  2006) .  It  has  been  suggested  that  the  downregulation  of 

124  the  H2AX  phosphorylation  signal  reflects  exchange  of 

125  phospho-H2AX  with  H2AZ  (reviewed  in  van  Attikum 

126  and  Gasser  2005a). 

127  Tip60,  the  human  ortholog  of  MstI  and  5cEsaI, 

128  acetylates  nonhistone  proteins  independently  of  the 

129  NuA4  complex  with  substrates  including  Notch  (Kim 

130  et  al  2007),  ATM  (Sun  et  al  2005;  Jiang  et  al  2006),  p53 

131  (Sykes  et  al  2006;  Tang  et  al  2006),  and  DNA-PKcs 

132  (Jiang  et  al  2006).  Tip60-deficient  cells  exposed  to 

133  ionizing  radiation  have  defects  in  chromosome  repair 

134  and  reduced  apoptosis  (Ikura  et  al  2000) .  This  leads  to 

135  the  important  conclusion  that  HATs  are  not  limited  to 

136  histones  as  substrates.  These  data  suggest  that  KAT5 

137  family  HATs  contribute  to  DNA  damage  response  in  two 

138  ways:  first,  by  affecting  DNA  repair  and  recovery  {e.g., 

139  through  histone  modifications  that  recruit  specific  DNA 

140  damage  response  proteins)  and,  second,  by  affecting 

141  the  checkpoint  response  {e.g,  through  direct  modifica- 

142  tion  of  p53  or  ATM) . 

1 43  To  gain  further  insight  into  the  diverse  function  of  the 

144  KAT5  protein  family  in  DNA  metabolism,  we  investi- 

145  gated  the  role  of  Schizosaccharomyces  pombe  MstI  in  the 


tractable  fission  yeast  system.  In  this  report,  we  present 
initial  genetic  characterization  of  this  gene.  MstI  is 
essential  for  viability,  and  mutants  are  sensitive  to  a 
variety  of  DNA-damaging  agents,  including  hydroxy¬ 
urea  (HU),  ultraviolet  (UV)  irradiation,  methyl  meth- 
anesulfonate  (MMS),  and  bleomycin.  Mutants  are  also 
sensitive  to  the  spindle  poison  thiabendazole.  Using  a 
two-hybrid  screen,  we  identify  a  number  of  potential 
interaction  proteins  involved  in  DNA  dynamics,  includ¬ 
ing  chromatin  proteins,  transcription  factors,  and  the 
recombination  protein  Rad22  (ScRad52).  Analysis  of 
the  HU  response  suggests  that  MstI  is  required  for 
normal  recruitment  of  Rad22  in  HU.  Thus,  MstI  has 
multiple  roles  in  maintenance  of  genome  integrity  in 
S.  pombe. 

MATERIALS  AND  METHODS 

Strains,  media,  and  manipulations:  Strains  used  in  this  study 
are  listed  in  Table  I.  Strains  were  grown  and  maintained  on 
yeast  extract  plus  supplements  (YES)  or  Edinburgh  minimal 
media  with  appropriate  supplements,  using  standard  techni¬ 
ques  (Eorsburg  and  Rhind  2006).  Matings  were  performed 
on  synthetic  sporulation  agar  plates  for  2-3  days  at  25°.  Tem¬ 
perature-sensitive  (ts)  cells  were  grown  at  25°  and  non-ts  cells 
at  32°.  Transformations  were  carried  out  by  electroporation. 
Double  mutants  were  constructed  by  standard  tetrad  analysis 
or  random  spore  analysis.  G4I8  plates  were  YES  supplemented 
with  100  fxg/ml  G4I8  (Sigma,  St.  Louis). 

Plasmids  and  constructions:  Disruption  and  tagging  of 
mstl+  is  described  in  Gomez  et  al  (2005).  Site-directed  mu¬ 
tants  were  constructed  using  the  Quick-Change  mutagenesis 
kit  (Stratagene,  La  Jolla,  CA). 

Primers  used  were  L-S/E  (CATACAATTTTCTTATGAATC 
GACCAAGCGTGAGCACAAAC)  and  L-S/R  GTTTGTGCTCA 
CGCTTGGTCGATTCATAAGAAAATTGTATG  and  L-P/E  GG 

AC  and  L-P/R  GTGATCCAGA\AAAGTTTGGACGGAAGA 
CAAArATTCCTACACC.  Complementation  and  temperature 
sensitivity  were  assessed  by  transforming  the  expression  plas¬ 
mid  into  the  heterozygous  mstl^ /^mst::ura4+  diploid  FYI754 
and  by  isolating  haploids  that  contain  both  Amstl : : ura4+  and 
the  plasmid  markers.  An  integrant  was  constructed  by  moving 
the  Pnmt-mstl-L344S  cassette  from  plasmid  pEBG99  into  plas¬ 
mid  pJKI48  to  create  plasmid  pEBG98  and  integrating  this 
construct  at  the  leu  1-3 2  locus  in  the  diploid.  Haploid  temper¬ 
ature-sensitive  segregants  containing  both  ^mstl::ura4+ 
and  leul^ :: nmt-mstl-L344S  were  isolated  by  random  spore 
analysis. 

Silencing  was  assessed  by  examining  growth  in  the  absence 
of  uracil  and  low  adenine  in  a  strain  with  ura4^  integrated  into 
the  centromere  and  ade6^  integrated  into  the  telomere  of  a 
minichromosome. 

FACS:  Samples  were  collected  after  5  and  10  hr,  ethanol 
fixed,  and  analyzed  by  flow  cytometry  as  described  previously 
(Gomez  and  Eorsburg  2003). 

Immunoprecipitation:  Samples  were  prepared  as  described 
in  Gomez  et  al  (2002) .  Eor  co-immunoprecipitation  with  MCM 
proteins,  we  used  anti-HA  antibody  I2CA5  (Abeam)  and 
antiV5  (Invitrogen,  San  Diego).  Eor  histone  acetylation  assays, 
the  dried  beads  were  treated  with  20  fxl  of  a  histone  acetylation 
mix  containing  5  fiM  CI4-Acetyl  CoA  (ICN),  0.7  fxg  of  core 
histones,  50  mM  HEPES  (pH  7.9),  10%  glycerol,  and  I  mM 
DTT.  The  reaction  was  incubated  I  hr  30  min  at  30°.  Beads 
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TABLE  1 

Strains  used  in  this  study 


Strain 


Genotype 


Source 
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261 

865 

1104 

1107 

1256 

1552 

1754 

2180 

2192 

2217 

2339 

2396 

2449 

2450 
2495 

2498 

2499 
2501 

2505 

2506 
2508 
2525 
2535 
2562 
2566 

2569 

2570 
2572 
2574 
2576 
2578 
2601 
2555 
3135 
3286 
3369 
3436 

3479 

3480 
3553 

3563 

3562 


hr  ade6-M210 

hr  ura4-D18  leul-32  ade6-M210  can  1-1 
hr  /^cdslr.urad  ura4-D18  leul-32 
hr  lS.rad3 : : ura4^  ura4  leul-32  ade6-M210  (?) 
hr  lS.rad3 : :  ura4^  ura4-D18  leul-32  ade6-M216 
hr  lS.mad2::ura4^  ade6M210  leul-32  ura4D18 
hr  Lhuhlr-ura4^  ade6M210  leul-32  ura4D18 
/hr  IS^mstl :: ura4^ ura4-D18/ura4-D18  leul-32/leul-32  ade6-M216/ade6-M210 
r  cbhl::GFP-pBSIIKS-ura4^-chhlC  leul-32  um4-D18 
hr  la^chkl : : ura4^  ura4-D18  ade6-M216 
hr  ^cbhl  y. LEU2  ura4-D18  ade6-M216  leul-32 

A  mstl : :  ura4^  leul^ : :  mstl -L344S  ura4-Dl  8  ade6-M21 6  (#7) 
hr  l^mstl  y.  ura4^  leul^ :: mstl-L344S  ura4-D18  ade6-M210 
Amstl  ykanMX6  leul^  y  mstl -L3 448^  ura4-D18  ade6-M210 
hr  Xmstl  y.kanMX6  leul^  y.mstl-L344S^  ura4-D18  ade6-M210 
h/  Xtazly-ura4^  Xmstl  y.kanMX6  leul^ :: mstl -L344Sura4-D  18  leul-32  ade6-M210 
hr  Xrhp31y.ura4^  Xmstl  y-kanMX6  leul^  y  mstl -L344Sura4-D  18  leul-32  ade6-M210 
Xrad50y-kanMX6  Xmstl  y.ura4^  leul^  y.mstl-L344S^ ura4-D18  leul-32  ade6-M21? 
Xrad32::ura4^  mstl y.kanMX6  leu  1^ :: mstl -L344S^  ura4-D18  leul-32  ade6-M21? 
h""  mst2yura4^  mstl  y.kanMX6  leul^  y.mstl-L344S^  ura4-D18  leul-32  ade6-M210 
hr  mst2::ura4^  mstl  y.kanMX6  leul^ :: mstl -L344S^  ura4-D18  leul-32  ade6-M210 
swi6y.ura4^  mstl  r.kanMXS  leu  1^  y  mstl -L344S^  ura4-D18  leul-32  ade6-M210 
hr  hskl-1312  mstlr.kanMXS  leu  1^ :: mstl -L344S^  leul-32  ade6-M21? 

Xmstl  ykanMX6  leul^ :: mstl -L3 448^  ade6-M210  ura4-D18 
Xradl2::ura4^  Xmstl  y-kanMX6  leul^  y.mstl-L3448^  ade6-M210  ura4-D18 
hr  A  clr4 : :  ura4^  A  mstl : :  kanMX6  leul^ : :  mstl -L3448^ 
ade6-M210  ura4-D18  (otrR(8phI) :: ade6^ )  ? 

orpl-4  Amstl  yura4^  leul^ :: mstl-L3448^  ade6-M210  ura4-D18  (canl-1)? 
h""  Xclr3y.kanMX6  Xmstly.ura4^  leul^  y.mstl-L3448^  ade6-M210  ura4-D18  (canl-1)? 
h/  mcm7-98  Xmstly.ura4^  leul^  r.  mstl-L3448^  ade6-M216  ura4-D18  (canl-1)? 
cdcl9-Pl  Xmstly.ura4^  leu  1^  y  mstl -L3448^  ade6-M210  ura4-D18 
poll-1  Xmstl  y  ura4^  leul^  y.mstl-L3448^  ade6-M210  ura4-D18 
cdc21-M68  Amstl yura4^  leul^ :: mstl -L3 448^  ade6-M216  ura4-D18  (canl-1)? 
Xskbl  y.ura4^  mstl  y.kanMX6  leul^ :: mstl -L3 448^  ura4-D18  ade6-M210 
Xmstl  ::kanMX6  leul^ :: mstl -L3 448  ade6-M210  ura4-D18 
hr  ade6-M210  leul-32  ura4-Dl  Xhip::ura4^ 
hr  Xcdsl  y.  ura4^  rad22YFP::KanMX6  ura4-D18  leul-32  ade6? 

Xmstly.ura4^  leu  1^ :: mstl -L3 448  ura4-D18  ade6-M21?  rad22-YFP:kanMX4 
hr  rad22-YFP:kanMX4  ade6-M210  ura4-D18  leul-32 
A  mscl : :  kanMX6  leul-32 
mscl-HA-kanMX6  leul-32 

h?  A  mstl : :  kanMX6  leul^ : :  mstl  -L3448^  otrlL(  dh/H indlll) : :  ura4^ 
leul-32  ura4A8/E  ade6AN/N  [chl6  M23::LEU2^  telradeS^l 
h?  Acbhl  y.LEU2  Amstl  y kanMX6  leul^ :: mstl-L3448^  ade6-M21?  ura4-D18 
h?  Amstl  y.kanMX6  leul^  y.mstl-L3448^  ade6-M210  ura4-D18  Ahipy.ura4^ 
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were  spun  down,  and  supernatant  was  recovered  and  boiled  in 
SDS-Laemmli  loading  buffer  before  electrophoresis  in  15% 
SDS-polyacrylamide  gel  electrophoresis  (PAGE).  The  gel  was 
fixed  and  treated  with  fluorography-enhancing  solution  (NEN 
Life  Science)  before  drying  and  autoradiography. 

Eor  co-immunoprecipitation  with  Rad22-YFP  or  Cbhl-GEP 
(strains  3436,  3480),  we  prepared  soluble  protein  lysates  from 
fission  yeast  cells  in  B88  buffer  as  in  Moreno  et  al.  (1991)  A 
total  of  1.0  mg  protein  was  used  per  immunoprecipitation. 
Lysates  were  precleared  with  sepharose  A  beads  (Repligen)  for 
1  hr.  One  microliter  of  anti-GEP  (Abeam  290)  was  added  and 
left  to  rotate  overnight  at  4°.  Proteins  were  analyzed  by  8% 


SDS-PAGE,  followed  by  immunoblotting  with  anti-V5  (Invi- 
trogen)  at  1:2500  dilution  and  with  anti-GEP  (living  colors, 
Clontech)  at  a  1:1000  dilution.  Crosslinking  immunoprecipi- 
tations  were  done  as  previously  described  (Auger  et  al.  2008). 
Briefly,  cells  were  crosslinked  with  1%  formaldehyde  (Sigma) 
for  50  min  on  ice.  Harvested  cells  were  vortexed  with  glass 
beads  in  IP  buffer  (50  mM  HEPES-KOH,  pH  1.6,  500  mM 
NaCL,  1  mM  EDTA,  1%  Triton  X-100,  0.1%  sodium  deoxy- 
cholate) .  Extracts  were  treated  with  DNase  1  to  reverse  cross¬ 
link.  A  total  of  1.0  mg  protein  was  used  per  immunoprecipitation. 
Whole-cell  extracts  and  immunoprecipitates  were  incubated  with 
SDS  sample  buffer  for  1  hr  at  90°  before  Western  blot  analysis. 
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Microscopy:  Whole  fixed  cells  were  stained  as  described  for 
DAPI  and  Calofluor  (Gomez  and  Forsburg  2003).  Cells  were 
visualized  with  a  Leica  DMR  microscope.  Objectives  used  were 
Leica  lOOX/1.30,  PLFL,  NA  =  1.30,  and  Leica  63X/1.32, 
PLApo,  NA  =  1.32.  Images  were  captured  with  a  Hamamatsu 
digital  camera  and  Improvision  Openlab  software  (Improvi- 
sion,  Lexington,  MA).  Images  were  assembled  using  Canvas 
(ACD/Deneba)  and  adjusted  for  contrast.  Nuclear  spreads 
were  performed  as  in  Pankratz  and  Forsburg  (2005)  with 
the  following  modifications.  Rad22yfp  was  detected  using 
Abeam  290-50  rabbit  anti-GFP,  1:2000  in  5%  BSA  (Sigma), 
and  secondary  donkey  anti-rabbit  cy3  1:500  DNA  was  stained 
with  lx  Hoechst  (Invitrogen) .  The  objective  used  was  Leica 
63X/1.32,  PLApo,  NA  =  1.32.  Images  were  captured  and 
assembled  as  above. 

Damage  assays:  Cells  were  grown  to  A595  =  1 ,  serially  diluted 
fivefold,  and  spotted  onto  YES  plates  or  YES  plates  containing 
10  fxg/ml  thiabendazole  (TBZ),  5  mM  HU,  or  0.005,  0.007,  or 
0.01%  MMS,  and  incubated  for  3-5  days  at  25°,  29°,  or  32°,  as 
indicated.  For  UV  irradiation  assay,  1000  cells  were  plated  in 
duplicate  onto  YES  and  immediately  treated  with  the  indicated 
doses  of  UV  irradiation  using  a  Stratalinker  (Stratagene). 
Control  plates  were  left  untreated.  Plates  were  incubated  at  32° 
for  2-3  days.  Colonies  were  counted  and  the  percentage 
survival  was  obtained  relative  to  the  number  of  colonies  grown 
on  the  untreated  control  plates. 

^  Two-hybrid  strains:  The  mstl+  open  reading  frame  was 
isolated  by  PCR  from  plasmid  pEBG72  with  primers  HAT-3 
forward  (AGATCTCGCCAAAAACTTGCTCAATCTTCTTCC) 
and  HAT-3  reverse  (GGTACCCGACATAGACTGGCACATTA 
CATCC)  and  cloned  into  the  two-hybrid  vector  pDBLeu  cleaved 
with  Ncdl  and  NoK  to  create  plasmid  pEBG91.  Budding-yeast 
two-hybrid  strain  AH  109  (Cion tech)  with  HIS3,  ADE2,  and  lacZ 
reporters  downstream  of  heterologous  GAL4-responsive  pro¬ 
moter  elements  was  transformed  with  an  S.  pombe  Matchmaker 
cDNA  library  (Clontech)  and  screened  for  interactions  as 
described  in  Leverson  et  al.  (2002).  Plasmids  from  positive 
clones  were  isolated  and  verified  by  re  transformation,  and  the 
identity  of  the  interacting  open  reading  frame  was  determined 
by  sequence. 

RESULTS 

Amstl  cells  undergo  disordered  mitosis:  Previously, 
we  identified  two  members  of  the  MYST  family  of  his¬ 
tone  acetyltransferases  in  fission  yeast.  Our  analysis  indi¬ 
cated  that  Mstl  is  the  likely  ortholog  of  S.  cerevisiae  Es3.1, 
and  like  5cEsal,  is  a  chromatin-bound  protein  that  is 
essential  for  viability  (Gomez  et  al.  2005).  Data  from 


Figure  1. — Spore  germi¬ 
nation.  (A)  Wild-type  diploid 
strain  {ura4^ /ura4-D  18)  and 
(B)  heterozygous  disrup¬ 
tion  mutant  {mstl^ /lS.mstl :: 
ura4^)  were  sporulated  and 
processed  for  random  spore 
analysis.  Spores  were  inocu¬ 
lated  into  media  lacking  ura¬ 
cil  and  sampled  for  flow 
cytometry  to  monitor  entry 
into  S  phase  and  the  process 
of  DNA  replication.  Cell 
morphology  was  visualized 
by  staining  for  DNA  with 
DAPI. 

metazoans  suggest  that  the  Hbol  histone  acetyl  transfer¬ 
ase,  which  is  also  a  MYST  family  protein,  may  function  in 
DNA  replication  (Iizuka  and  Stillman  1999;  Burke 
et  al.  2001;  Iizuka  et  al.  2006).  To  investigate  whether 
S.  pombe  Mstl  contributes  to  DNA  replication,  we  ex¬ 
amined  the  phenotype  of  cells  containing  the  Amstl 
allele.  Mutant  diploids  heterozygous  for  Amstl  ::ura4^ 
and  wild-type  diploids  heterozygous  for  ura4^  were 
induced  to  sporulate.  We  inoculated  purified  spores 
from  both  diploids  into  medium  lacking  uracil  to  carry 
out  bulk  spore  germination.  Only  spores  containing  the 
ura4^  gene  are  able  to  germinate  in  these  conditions, 
and  their  DNA  content  was  determined  by  flow  cytom¬ 
etry.  Spores  containing  Amstl : : ura4^  synthesized  DNA 
to  an  approximately  2C  DNA  content,  with  timing  sim¬ 
ilar  to  that  of  wild  type  (Figure  1 ) .  Half  of  each  popula¬ 
tion  contains  nongerminating  ura4-D18  spores,  which 
persist  as  a  1C  population.  Upon  microscopic  examina¬ 
tion,  we  observed  that  many  of  the  l^mstl : : ura4-\-  cells 
showed  highly  disordered  mitosis,  including  a  high 
fraction  of  phenotypes.  This  suggests  defects  in  chro¬ 
mosome  segregation,  which  might  result  from  mitotic 
defects  or  from  replication  defects  and  checkpoint 
disruption. 

Isolation  and  characterization  of  a  temperature-sensi¬ 
tive  allele:  To  better  characterize  Mstl,  we  constructed  a 
temperature-sensitive  allele  based  on  temperature-sensi¬ 
tive  alleles  in  S.  cerevisiae  ESAl  (Clarke  et  al.  1999).  We 
tested  two  mutations:  L271P  (corresponding  to  L254P  in 
5cEsal)  and  L344S  (corresponding  to  L267S  in  5cEsal). 
Constructs  containing  these  were  placed  in  a  LEU2 
plasmid  under  control  of  the  nmtl  promoter  and  trans¬ 
formed  into  the  Amstl::ura4^  diploid.  Following  sporu- 
lation,  we  assessed  recovery  of  spores  at  25°  containing 
both  the  ura4^  marker  (marking  Amstl)  and  the  LEU2 
marker  (marking  the  plasmid) .  Both  mutants  were  able 
to  complement  the  disruption  at  25°.  However,  upon 
shift  to  36°,  only  the  mstl-L344S  strain  was  temperature 
sensitive. 

We  attempted  to  replace  the  endogenous  mstl^  allele 
with  the  L5445 allele,  without  success.  We  reasoned  that, 
when  integrated  under  the  native  promoter,  a  single 
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Figure  2. — Characterization  of  temperature-sensitive  allele.  (A-D)  Comparison  of  growth  rate  and  FACS  for  wild-type  (FYll) 
and  mstl-L344S  cells  (FY  2396)  grown  in  the  presence  of  thiamine  at  25°  or  36°.  (Top)  Colony  formation  on  plates,  where  colony 
size  is  proportional  to  growth  rate.  (Bottom)  FACS  analysis  following  shift  to  36°.  (E)  mstl-L344S  cells  at  25°  (left)  or  36°  at  6 
(middle)  and  8  (right)  hr  after  shift.  Cells  were  stained  with  DAPI  and  Calcofluor.  (F)  Fraction  of  cells  with  indicated  phenotypes 
during  a  temperature-shift  experiment. 


copy  of  mstl-L344S  might  be  insufficient  for  viability. 
Therefore,  we  integrated  mstl-L344S  under  control  of 
the  nmtl^  promoter  into  the  leul-32  locus  of  the  het¬ 
erozygous  diploid  and  sporulated  to  create  the  haploid 
strain  l^mstl::ura4^  leul^ :  :Pnmti-mstl-L344S.  Upon  shift 
to  36°  and  addition  of  thiamine  to  shut  off  the  nmtl 
promoter,  this  strain  was  unable  to  grow.  However,  cells 
were  able  to  grow  in  the  presence  of  thiamine  at  25°  or 
32°,  indicating  that  the  protein  is  functional  at  lower 
temperatures.  There  is  sufficient  background  expression 
of  the  nmtl^  promoter  even  the  presence  of  thiamine 
for  stable  proteins  to  be  maintained  (Figure  2,  A-D).  An 
equivalent  strain  with  a  marker  at  Amstl  behaved 

similarly  (data  not  shown) . 

We  examined  the  phenotype  of  cells  containing  mstl- 
L344S  by  shifting  asynchronous  cells  to  the  restrictive 
temperature.  The  increase  of  cell  number  was  blocked 
within  2  hr.  However,  cell  mass  measured  by  A595  con¬ 
tinued  to  increase  for  hr.  DNA  content,  measured  by 
flow  cytometry,  showed  no  change  from  the  2C  peak 
characteristic  of  exponentially  growing  wild-type  fission 
yeast,  although  later  time  points  showed  a  flattening  of 
the  profile  and  an  increase  of  cells  with  higher  DNA 
content.  This  is  common  in  heterogeneous,  elongated 
cells  and  is  typical  of  most  c<ic  mutants. 

We  examined  cells  for  nuclear  and  septum  morphol¬ 
ogy  by  staining  with  DAPI  and  Calcofluor,  respectively, 
and  observing  under  fluorescence  (Figure  2E,  quanti- 
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fied  in  Figure  2F).  Strikingly,  the  number  of  cells  517 
staining  with  Calcofluor  significantly  increased  in  the  518 
mstl-L344S  strain  at  restrictive  temperature,  with  nearly  519 
50%  of  cells  arrested  with  septa  by  6  hr.  Some  of  these  520 
were  mononucleate,  showing  asymmetric  nuclear  seg-  521 
regation.  Additionally,  there  was  an  increase  in  the  over-  522 
all  number  of  binucleate  cells  relative  to  wild  type.  523 

S.  pombe  has  an  unusual  cell  cycle  in  which  cytokinesis  is  524 

completed  during  the  subsequent  nuclear  cell  cycle.  525 
Thus,  cells  with  a  2C  DNA  content  by  FACS  may  be  526 
mononucleate  cells  in  G2  or  M  phase  or  binucleated  527 
cells  in  which  the  nuclei  are  in  Gi  or  S  phase  and  cyto-  528 
kinesis  is  not  complete.  Therefore,  arrest  as  a  binucleate  529 

septated  cell  may  represent  a  defect  entering  DNA  rep-  530 
lication  in  the  ensuing  cell  cycle.  Alternatively,  it  could  531 
represent  a  defect  in  cytokinesis  that  delays  entry  into  532 
the  next  cell  cycle.  533 

This  phenotype  is  different  from  the  disordered  534 
mitosis  seen  in  the  Amstl  mutant  spores.  Because  ger-  535 
minating  spores  arise  from  stationary  or  Gi  cells,  while  536 
exponentially  growing  S.  pombe  are  predominantly  G2  537 
with  a  2C  DNA  content,  we  reasoned  that  the  different  538 
phenotypes  could  represent  the  effect  of  inactivating  539 
Mstl  at  different  cell  cycle  stages.  Alternatively,  they  540 
could  suggest  that  the  mstl-L344S  allele  creates  a  hypo-  541 

morph  rather  than  a  complete  inactivation  of  the  gene.  542 

We  attempted  to  resolve  whether  the  mutation  affects  543 
S  phase  by  synchronizing  the  mstl-L344S  cells  in  Gi,  544 


6 


E.  B.  Gomez  et  al. 


545  TABLE  2 


Genetic  interactions  between  mstl-L344S  and  other  mutants 

547 


548 

549 

Mutation 

Protein  affected 

Double-mutant 

phenotype 

550 

551 

552 

553 

mcm2ts 

DNA  replication 
MCM  helicase 

Reduced  growth 

mcm4ts 

MCM  helicase 

temperature 
Reduced  growth 

554 

555 

mcm7ts 

MCM  helicase 

temperature 
Reduced  growth 

556 

557 

orpl  ts 

ORC  subunit 

temperature 
Reduced  growth 

558 

559 

560 

poll  ts 

DNA  polymerase  a 

temperature 
Reduced  growth 

hskl  ts 

Cdc7  kinase 

temperature 
Reduced  growth 

561 

562 

Arad2 

FEN  endonuclease 

temperature 
No  effect 

563 

564 

Arad32 

DSB  repair 

MREll 

No  effect 

565 

Arad50 

RAD50 

Reduced  growth 

566 

567 

Arhp51 

RAD51 

temperature 
Reduced  growth 

568 

569 

570 

Arqhl 

SGSl/Blm  helicase 

temperature 
No  effect 

Artil 

Rad52  homolog 

No  effect 

571 

Centromere/heterochromatin 

572 

Acbhl 

CENP-B  homolog 

No  effect 

573 

Aclr3 

Histone  deacetylase 

Reduced  growth 

574 

Aclr4 

Histone  methyltransferase 

temperature 
Reduced  growth 

575 

576 

Ahipl 

HIRA  homolog 

temperature 
No  effect 

577 

Amscl 

Putative  demethylase 

No  effect 

578 

Amst2 

MYST  acetyltransferase 

No  effect 

579 

Aswi6 

HPl  homolog 

Reduced  growth 

580 

581 

582 

Asm 

Other 

Arginine  methyltransferase 

temperature 

No  effect 

583 

Atazl 

Telomere-binding  protein 

No  effect 

584  Double  mutants  were  constructed  and  growth  rate  relative 

585  to  the  single  mutants  was  assessed  by  streaking  at  different 
temperatures. 

587 

588  using  nitrogen  starvation  (supplemental  Figure  1). 

589  When  released  from  nitrogen  starvation  into  complete 

590  medium,  mst-L344S  cells  completed  replication  with 

591  similar  timing  to  wild  type,  also  suggesting  that  in- 

592  activation  of  mstl  by  the  mstl-L344S  allele  or  by  the 

593  Amstl  allele  is  not  sufficient  for  cells  to  block  DNA 

594  replication  in  the  first  cell  cycle. 

595  mstl-L344S  genetically  interacts  with  replication, 

596  recombination,  and  heterochromatin  mutants:  We  ex- 

597  amined  the  phenotypes  of  double  mutants,  which 

598  contained  mstl-L344S  as  well  as  mutations  in  genes 

599  required  for  DNA  replication,  double-strand  break 

600  repair,  and  heterochromatin  assembly  (Table  2,  Figure 

601  3),  We  tested  the  double-mutant  strains  for  their  tem¬ 


perature  sensitivity.  We  defined  a  genetic  interaction  as  602 

occurring  when  a  double-mutant  strain  was  unable  to  603 

grow  at  temperatures  where  both  parents  were  growth  604 

proficient  (“reduced  growth  temperature”).  There  605 

were  modest  genetic  interactions  with  many  replication  606 

initiation  mutants.  Additionally,  we  observed  that  the  607 

maximum  growth  temperature  was  reduced  when  mstl  608 

was  combined  with  mutations  ^Thp31  (ScRadbl),  609 

^md50,  and  ^rad22,  which  affect  homologous  DNA  610 

recombination.  The  t^md22  mstl-L344S  strain  grew  611 

extremely  slowly  even  at  25°;  however,  because  of  the  612 

high  frequency  of  spontaneous  suppressors  in  ^rad22  613 

backgrounds  (Osman  et  at.  2005),  we  cannot  be  confi-  614 

dent  that  this  strain  is  suppressor  free.  615 

Because  mstl -L344S cells  show  defects  in  chromosome  616 

segregation  and  are  sensitive  to  thiabendazole  (see  617 

Figure  5 ) ,  we  examined  interactions  with  genes  required  6 1 8 

for  centromere  heterochromatin  assembly  (Table  2,  619 

chromatin).  We  observed  that  maximum-growth  tern-  ^  620 

perature  was  strikingly  lower  than  that  of  the  parents  621 

when  mstl-L344S  was  combined  with  ^swi6,  ^clr3,  and  622 

S.clr4,  all  of  which  define  genes  required  to  establish  and  623 

maintain  centromeric  heterochromatin  (reviewed  in  624 
Ekwall  2007).  However,  we  observed  no  defects  in  625 
centromere  or  telomere  silencing  in  mstl-L344S  cells  626 
(data  not  shown) .  This  suggests  that  the  heterochroma-  627 

tin  itself  is  intact.  We  examined  additional  chromatin-  628 

related  proteins.  No  effect  is  observed  when  mstl-L344S  629 
is  combined  with  mutation  of  histone  acetyltransferase  630 
Amst2  or  the  telomere-binding  protein  ^tazl.  631 

Identification  of  potential  Mstl  interaction  partners  632 
by  two-hybrid  screening:  SpMstl  interacts  with  the  633 

actin-related  protein  SpAlp5/ScArp4,  a  known  subunit  634 
of  the  NuA4  complex  (Minoda  et  al.  2005).  Because  635 

evidence  suggests  that  TipGO  may  function  indepen-  636 

dently  of  the  NuA4  complex  (see  Introduction),  we  637 

sought  to  identify  additional  proteins  that  interact  with  638 

Mstl,  using  a  two-hybrid  screen.  We  isolated  a  number  639 
of  interacting  gene  products,  summarized  in  Table  3.  640 

We  isolated  Rad22,  the  S.  pombe  ortholog  of  homol-  641 

ogous  DNA  recombination  protein  Rad52.  Rad22  642 

promotes  the  strand-invasion  step  of  homologous  re-  643 

combination.  Fission  yeast  has  an  additional  Rad22  644 

family  member,  called  Rtil .  We  constructed  plasmids  to  645 

test  whether  Mstl  interacted  in  the  two-hybrid  system  646 

with  Rtil  or  with  Rhp51.  Although  Rhp51  and  Rad22  647 

themselves  interact  in  two-hybrid  screens  {e.g.,  Catlett  3]  548 

and  Forsburg  2003),  we  observed  no  evidence  for  649 
interaction  between  Mstl  and  Rhp51  or  between  Mstl  650 

and  Rtil,  suggesting  that  this  interaction  is  specific  651 
(Figure  4A) .  Immunoprecipitation  of  Rad22-YFP  using  652 
a  cross-reacting  anti-GFP  antibody  precipitates  Mstl-V5  653 
(Figure  4B);  however,  immunoprecipitation  of  Mstl-V5  654 

does  not  precipitate  Rad22-YFP  (data  not  shown) .  655 

We  isolated  three  chromatin-related  proteins  as  po-  656 

tential  Mstl  partners.  The  first,  Hipl,  is  a  putative  histone  657 

chaperone  most  closely  related  to  ScHirl  and  human  658 
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Figure  3. — Genetic  interactions.  Double 
mutants  were  constructed  and  serial  dilu¬ 
tions  were  performed  at  different  tem¬ 
peratures  to  characterize  any  synthetic 
interactions.  Equivalent  numbers  of  expo¬ 
nentially  growing  cells  were  diluted  fivefold 
and  incubated  at  the  indicated  temperatures. 


HIRA  (Blackwell  et  al  2004).  The  second,  Cbhl,  is 
one  of  the  three  CenpB  homologs  in  fission  yeast 
associated  with  centromere  assembly  (Baum  and 
Clarke  2000;  Nakagawa  et  al  2002).  We  tested  all 
three  CenpB  homologs  for  interaction  with  Mstl.  Only 
Cbhl  showed  a  specific  interaction  (Figure  4).  We 
observed  no  co-immunoprecipitation  between  Cbhl 
and  Mstl  from  soluble  or  crosslinked  lysates  (data  not 
shown). The  third  gene  that  we  isolated  was  Mscl.  Mscl 
was  identified  as  a  high-copy  suppressor  of  the  check¬ 
point  mutation  AcMi;  it  has  E3  ligase  activity  and 
homology  to  a  family  of  demethylases,  and  mutation 
disrupts  histone  acetylation  (Ahmed  et  al.  2004;  Dul  and 
Walworth  2007).  We  constructed  double  mutants 
containing  mstl-L344SyA\h  ^hipl,  Acbhl,  or  Amscl  and 
tested  growth  at  different  temperatures.  There  was  no 
difference  in  the  growth  characteristics  of  the  double 
mutants  compared  to  mstl-L344S  cells  alone.  We  were 
unable  to  detect  Mscl  in  soluble  extracts  and  our 
crosslinking  protocol  was  unsuccessful  on  this  protein 
(data  not  shown). 

The  most  frequently  isolated  two-hybrid  clone  was 
skbl^ ,  which  encodes  an  arginine  methyl  transferase,  an 
^  ortholog  of  S.  cerevisiaeW^Yl  (Fujita  et  al  1999;  Shulewitz 
et  al  1999;  Lee  et  al  2000) .  We  observed  no  obvious  genetic 
interactions  between  Askbl  and  mstl-L344S,  in  contrast  to 


budding  yeast  where  hsl7  and  esal  mutants  have  been 
shown  to  have  synthetic  interactions  (Ruault  and  Pillus 
2006).  Two  putative  transcription  factors  were  also  iso¬ 
lated.  The  res2/pctl^  gene  has  been  studied  extensively 
and  it  is  required  for  Gi/S-specific  transcription  (Tanaka 
etal  1992;  Zhu  etal  1994).  taflll^  (also  known  as  tafl^)  is 
a  putative  TFllD  subunit  required  for  induction  of  genes 
during  nitrogen  starvation  and  sexual  development 
(Ueno  et  al  2001).  The  remaining  two-hybrid  isolates  in¬ 
cluded  bud6^,  which  is  also  associated  with  actin  function 
(Glynn  et  al  2001),  and  a  SEC  18  homolog  likely  involved 
in  vesicular  transport  (SPAC1834.11c).  The  only  pre¬ 
viously  described  fission  yeast  Mstl  interactor,  Alp5 
(Minoda  et  al  2005),  was  not  isolated  in  our  screen,  nor 
were  any  other  subunits  of  the  NuA4  complex. 

We  observed  very  little  Mstl  (or  Mst2)  protein  in  the 
soluble  fraction  (supplemental  Figure  2a).  This  may 
explain  why  our  attempts  to  confirm  the  two-hybrid 
interactions  using  co-immunoprecipitation  in  fission 
yeast  were  for  the  most  part  unsuccessful.  Upon  over¬ 
production  of  Mstl,  it  is  possible  to  observe  some 
interactions  with  other  proteins;  for  example,  we  found 
that  Mcm2  can  interact  with  Mstl  or  Mst2  but  only  when 
both  partners  are  overproduced  (supplemental  Figure 
2b) .  However,  we  observed  no  interactions  with  endog¬ 
enous  levels  of  MCM  proteins  (data  not  shown),  even 
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TABLE  3 


Genes  isolated  by  two-hybrid  screening 


P-Galactosidase 

Gene 

Function 

-f 

bud6/fatl^ 

Actin-interacting  protein  (ScBUD6/AIP3  homolog)  (Glynn  et  al.  2001) 

-f 

cbhl^ 

One  of  three  fission  yeast  CENP-B  proteins.  Required  for 
heterochromatin  function.  No  S.  cerevisiae  homolog 
(Baum  and  Clarke  2000;  Nakagawa  et  al.  2002). 

-f 

hipl^ 

HIRA  histone  chaperone  (ScHIRl  homolog)  (Blackwell  et  al.  2004) 

-f-f 

mscl^ 

High-copy  suppressor  of  ArM;  E3  ubiquitin  ligase  and 

putative  demethylase  (Ahmed  et  al.  2004;  Dul  and  Walworth  2007) . 
No  N  cerevisiae  homolog. 

-f-f 

rad22^ 

Homologous  recombination  protein  (ScRAD52  homolog) 

(Ostermann  et  al.  1993) 

-f 

res2/pctl^ 

Transcription  factor;  forms  a  complex  with  CdclO  (ScMBPl  homolog) 
(Miyamoto  et  al.  1994). 

-f 

seel  8^ 

Putative  vesicle  membrane  fusion  protein;  uncharacterized  5rSecl8 
homolog.  ORE:  SPAC 1834. lie. 

-f-f 

skbl^ 

Arginine  methyl  transferase  (ScHSL7  homolog)  (Gilbreth  et  al.  1996) 

-f-f 

taflir 

Transcription  factor;  putative  TFIID  subunit  required  for  sexual 
development  SPAC2G11.14  (Ueno  et  al.  2001). 

|3-Galactosidase  is  a  qualitative  measure  of  the  degree  of  activation  in  the  two-hybrid  system. 

though  MCMs  themselves  can  be  coprecipitated  under  centromere  or  kinetochore  function  are  often  TBZ 

those  conditions  {e.g.,  Sherman  et  al.  1998;  Gomez  et  al.  sensitive;  this  may  relate  to  the  synthetic  interactions 

2002).  We  completed  our  initial  characterization  of  that  we  observed  with  the  heterochromatin  mutants 

Mstl  protein  by  showing  that  acetylation  of  the  four  core  (Table  2)  and  to  the  physical  interactions  with  Cbhl  and 

histones  can  be  observed  in  an  IP-acetylation  assay  Mscl,  which  affect  centromere  function  (Table  3). 

(supplemental  Figure  3).  Mstl  is  required  for  recovery  from  HU  arrest:  HU 

mstl-L344S  cells  are  damage  sensitive:  Data  suggest  causes  nucleotide  depletion,  which  results  in  replica- 

that  the  NuA4  complex  in  other  species  is  recruited  to  tion  fork  stalling  early  in  S  phase.  Defects  in  HU 

double-strand  breaks  through  association  with  phos-  response  may  reflect  a  failure  to  arrest  the  cell  cycle 

phorylated  histone  H2AX  (Downs  et  al.  2004;  Robert  during  the  acute  treatment  or  an  ability  to  restore 

et  al.  2006) .  Our  data  showing  an  interaction  both  replication  and  recover  after  HU  is  removed  (reviewed 

genetically  and  physically  with  Rad22  suggested  that  in  Branzei  and  Foiani  2007).  As  shown  in  Figure  6,  A, 

Mstl  may  contribute  to  repair  of  DSBs  by  influencing  B,  and  D,  mstl-L344S  mutants  arrest  with  a  1C  DNA 

homologous  recombination.  Rad22  has  also  been  im-  content  and  modestly  elongated  morphology,  as  do 

plicated  in  recovery  from  HU  (Meister  et  al.  2005;  wild-type  and  Acdsl  cells,  indicating  that  checkpoint- 

Bailis  et  al.  2008).  Therefore,  we  examined  mstl-L344S  mediated  cell  cycle  arrest  is  intact.  As  seen  in  the  time 

mutants  for  evidence  of  a  role  in  the  damage  response.  course  in  Figure  6,  A  and  B,  wild-type  cells  complete  S 

We  first  examined  growth  of  mstl-L344S  in  response  phase  within  30  min  of  removing  HU,  while  Acdsl 

to  chronic  exposure  to  low  levels  of  DNA-damaging  mutants  are  severely  delayed,  consistent  with  their 

agents  (Figure  5A) .  We  examined  MMS  (alkylating  recovery  defect.  Wild-type  cells  also  proceed  to  division 

agent  delays  replication),  HU  (inhibits  ribonucleotide  and  enter  the  next  cell  cycle,  as  seen  in  the  increase  of 

reductase  and  causes  fork  stalling),  and  camptothecin  4C  population  (reflecting  binucleate  cells  that  com- 

(CPT;  blocks  topoisomerase  and  causes  DSBs).  mstl-  plete  S  phase  before  completing  the  previous  cytokine- 

L344S  cells  were  sensitive  to  HU  and  MMS,  but  not  to  sis).  This  population  is  not  seen  in  the  Acdsl  mutants, 

CPT  at  29°.  mstl-L344S  was  also  sensitive  to  UV  irradi-  which  are  defective  for  recovery  and  show  a  broad, 

ation  (200J/m^),  but  not  toy-irradiation  (400  Gy)  at  25°  heterogeneous  peak  of  DNA  content  at  the  later  time 
(notshown).  We  observed  that  sensitivity  to  these  agents  points.  The  mstl  cells  resemble  wild  type,  suggesting 

increased  with  temperature  (data  not  shown).  The  that  the  mutants  maintain  and  restart  the  replication 

L344S  mutant  was  extremely  sensitive  to  bleomycin  at  fork  normally.  This  suggests  that  the  sensitivity  that  we 

32°  (radio-mimetic;  Figure  5B) .  These  results  are  con-  observed  in  mstl  cells  in  HU  reflects  defects  in  a  later 

sistent  with  known  roles  for  Mstl  family  acetyltrans-  stage  of  recovery.  Similar  results  have  been  seen  for 

ferases  in  some  forms  of  DSB  repair.  We  also  observed  Arad22  and  ^rhp51  (Meister  et  al.  2005;  Bailis  et  al. 

substantial  sensitivity  to  thiabendazole,  a  microtubule-  2008).  Interestingly,  although  mstl-L344S  cells  were 

destabilizing  drug  (Figure  5C).  Mutations  affecting  sensitive  to  HU  during  chronic  treatment  (Figure  5), 


830 

831 

832 

833 

834 

835 

836 

837 

838 

839 

840 

841 

842 

843 

844 

845 

846 

847 

848 

849 

850 

851 

852 

853 

854 

855 

856 

857 

858 

859 

860 
861 
862 

863 

864 

865 

866 

867 

868 

869 

870 

871 

872 

873 

874 

875 

876 

877 

878 

879 

880 
881 
882 

883 

884 

885 

886 


Mstl  Required  for  Damage  Response 


9 


887 

888 

889 

890 

891 

892 

893 

894 

895 

896 

897 

898 

899 

900 

901 

902 

903 

904 

905 

906 

907 

908 

909 

910 

911 

912 

913 

914 

915 

916 

917 

918 

919 

920 

921 

922 

923 

924 

925 

926 

927 

928 

929 

930 

931 

932 

933 

934 

935 

936 

937 

938 

939 

940 

941 

942 

943 


B 

Mst1-V5 

Rad22-YFP 


Input 

-  +  -  + 

-  -  +  + 


IP 

-  -f  -  -f- 

-  -  +  + 


anti-GFP 

anti-V5 


Figure  4. — Two-hybrid  interactions.  (A)  A  prey  construct 
containing  Mstl  was  tested  for  interaction  with  full-length 
Rad22,  Rtil,  Rhp51,  Cbhl,  Cbh2,  and  Abpl  (see  materials 
AND  methods)  .  S.  cerevisiae cells  containing  the  indicated  plas¬ 
mids  were  tested  for  growth  on  SD  media  lacking  histidine 
and  adenine  (right),  compared  to  control  media  containing 
both  supplements.  Growth  on  the  selective  media  indicates  a 
successful  two-hybrid  interaction.  (B)  Physical  interaction  be¬ 
tween  Mstl-V5  and  Rad22-GFP.  One  milligram  of  protein  was 
immunoprecipitated  using  1  pi  of  anti-GFP  antibody.  Input 
lanes:  50  pg  of  total  protein  extract.  GFP  was  detected  by 
ECL+  incubation  for  30  min.  After  30  min,  V5  blots  were  ex¬ 
posed  to  film  after  ECL+  (Pierce)  incubation  for  30  sec. 

viability  was  relatively  unaffected  by  an  acute  treatment 
in  the  HU  block-and-release  protocol  (Figure  6C) .  This 
suggests  that  prolonged  exposure  to  HU  leads  to  an 
accumulation  of  growth  defects,  whereas  transient 
exposure  for  a  single  generation  causes  relatively  little 
problem. 


One  measure  of  DNA  damage  is  provided  by  re¬ 
cruitment  of  recombination  proteins  to  repair  centers 
(Caspari  et  al  2002;  Du  et  al  2003;  Meister  et  al  2005) . 
During  HU  treatment,  Rad22  focus  formation  can 
distinguish  two  functions.  In  wild-type  cells,  Rad22  is 
recruited  in  a  burst  during  the  recovery  phase  after  cells 
are  released  from  HU.  In  contrast,  in  checkpoint 
mutants  that  fail  to  respond  properly  to  HU,  collapsed 
replication  forks  cause  DNA  breaks  and  Rad22  is 
recruited  during  HU  treatment  (Caspari  et  al.  2002; 
Du  et  al  2003;  Meister  et  al  2005;  Bailis  et  al  2008) .  To 
determine  whether  reduced  Mstl  function  causes  DNA 
damage,  we  examined  Rad22YFP  focus  formation  in 
wild-type  and  mstl-L344S  cells  at  32°  (Figure  6,  E  and  F) . 
Due  to  background  fluorescence  observed  in  mstl- 
L344S  mutant  cells,  we  had  to  detect  Rad22-YFP  using 
indirect  immunofluorescence  on  nuclear  spreads.  We 
observed  a  modest  increase  in  Rad22  focus  formation  in 
mstl-L344S  cells  without  HU  treatment,  relative  to  wild- 
type  cells  (compare  0-hr  time  point  of  mstl  and  wild  type 
in  Figure  6F).  This  suggests  that  the  cells  suffer  some 
amount  of  endogenous  damage.  During  acute  HU 
treatment,  Acdsl  cells  show  a  dramatic  increase  in 
Rad22  YFP  focus  formation,  consistent  with  replication 
fork  collapse  and  double-strand  breaks  (Meister  et  al 
2005;  Bailis  et  al  2008).  There  is  no  change  in  the 
Rad22YFP  foci  in  mstl-L344S,  again  consistent  with  the 
model  that  replication  forks  are  intact. 


DISCUSSION 

The  MYST  family  of  histone  acetyltransferases  is 
associated  with  diverse  functions  (reviewed  in  Utley 
and  Cote  2003).  Previously,  we  showed  that  S.  pombe 
mstl^  is  an  essential  gene  that  encodes  a  protein 
constitutively  bound  to  the  chromatin  (Gomez  et  al 
2005).  In  this  report,  we  have  characterized  the  pheno¬ 
types  associated  with  deletion  of  mstl^  and  a  tempera¬ 
ture-sensitive  allele. 

S.  pombe  Mstl  is  the  ortholog  of  S.  cerevisiae  Esal  and 
human  TipGO,  the  catalytic  subunit  of  the  NuA4  family 
of  histone  acetyltransferases,  which  is  now  referred  to  as 


0.01%  MMS  5mMHU 


2  uM  CPT 


Figure  5. — Damage  sensitivity 
of  mstl-L344S.  (A)  mstl  sensitivity 
relative  to  DNA-damaging  agents. 
Exponentially  growing  cells  were 
diluted  fivefold  on  YES  plates  with 
the  indicated  drugs  and  grown  4 
days  at  29°.  (B)  mstl  is  sensitive 
to  bleomycin.  Exponentially 
growing  cells  were  diluted  fivefold 
on  YES  plates  and  grown  3  days  at 
32°.  (C)  mstl  is  sensitive  to  the 
spindle  poison  TBZ.  Exponen¬ 
tially  growing  cells  were  diluted 
fivefold  on  YES  plates  and  grown 
3  days  at  32°. 
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Figure  6. — mstl-L344S  has  HU  recovery  defect.  (A  and  B)  Time  course  of  DNA  content  following  release  from  HU  at  32°  (A) 
and  36°  (B)  for  cells  in  YES.  Strains:  wild  type  (FYll),  mstl-L344S  (FY2396),  Acdsl  (FY865).  (C)  Viability  of  cells  following  treat¬ 
ment  with  12  mM  HU  at  32°.  Circles,  wild  type;  triangles,  Acdsl;  squares,  mstl.  Open  symbols,  no  HU;  solid  symbols,  12  mM  HU. 
(D)  Morphology  of  cells  in  C  released  before  (—HU),  during  (4  hr  HU),  and  after  (2.5  hr  rel).  Cells  were  fixed  in  ethanol  and 
stained  with  DAPI.  Bar,  10  fxm.  (E)  Rad22YFP  foci  in  wild-type,  Acdsl,  or  mstl  cells  after  2  hr  in  HU.  Rad22YFP  was  detected  by 
immunofluorescence  with  anti-CEP  antibody,  which  recognizes  the  YEP  variant.  (F)  Fraction  of  cells  with  Rad22YFP  foci  at  0  or 
2  hr  of  incubation  in  HU.  -,  negative  control  with  no  Rad22YFP.  Strains:  wild  type  (FY261) ,  wild  type  with  Rad22YFP  (FY3436) ,  mstl 
RadllYFP  (FY3369),  Acdsl  Rad22YFP  (FY3286). 


the  KAT5  family  (reviewed  in  Squatrito  et  al.  2006; 
Allis  et  al.  2007) .  In  addition  to  its  presumed  effects  in 
transcriptional  regulation  (Smith  et  al.  1998;  Allard 
et  al.  1999;  Clarke  et  al.  1999,  2006;  Durant  and  Pugh 
2006) ,  this  family  is  implicated  in  checkpoint  control  by 


direct  acetylation  of  ATM  and  p53  (Sun  et  al.  2005;  Jiang 
et  al.  2006;  Sykes  et  al.  2006;  Tang  et  al.  2006),  as  well  as 
in  repair  by  direct  recruitment  to  phosphorylated 
histone  H2AX  at  double-strand  breaks  (Bird  et  al. 
2002;  Downs  et  al.  2004;  Tamburini  and  Tyler  2005; 
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TABLE  4 

Summary  of  processes  associated  with  mstl  +  function 
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Associated 

genes  Function 

mstlts  genetic 
interactions 

Mstl  physical 
interactions 

rhp51+ 

Process:  DNA  double-strand  break  repair 

Phenotypes:  mstlts  mutant  sensitivity  to  DNA-damaging  agents  MMS,  bleomycin,  and  CPT 
Homologous  recombination  Yes 

None  (Y2H) 

rad22+ 

Homologous  recombination 

Yes 

Yes  (Y2H,  IP) 

mcm2+ 

Process:  replication  fork  stability/ recovery 
Evidence:  mstlts  mutant  is  sensitive  to  chronic  HU  exposure. 

Rad22  foci  even  under  permissive  conditions 
MCM  helicase,  required  for  replication  fork  assembly 

Increased 

Yes  (all) 

Mcm2  (coIP) 

mcm4+ 

mcm7+ 

hskl+ 

and  elongation,  and  recovery  from  HU 

MCM  kinase,  required  for  replication 

Yes 

ND 

poll+ 

initiation  and  HU  recovery 

DNA  polymerase  a 

Yes 

ND 

Process:  centromere  function 

Evidence:  Amstl,  mstlts  mutants  have  chromosome  segregation  defects,  mstlts  mutant  is  TBZ  sensitive. 

dr 3+ 

No  silencing  defect  suggests  that  the  heterochromatin  is  intact  in  mstlts. 
Pericentromere  heterochromatin  assembly,  silencing.  Yes  (all) 

ND 

dr4+ 

swi6+ 

cbhl+ 

centromere  cohesion,  assembly  of  Cnpl  histone  in  central  core 

Pericentromere  heterochromatin  assembly,  assembly  of  Cnpl 

None 

Yes  (Y2H) 

mscl+ 

histone  in  central  core 

Assembly  of  Cnpl  histone  in  central  core,  kinetochore  assembly 

None 

Yes  (Y2H) 

hirl  + 

Pericentromere  heterochromatin  silencing,  kinetochore  assembly 

None 

Yes  (Y2H) 

J8]  IP,  co-immunoprecipitation.  Y2H,  yeast  two-hybrid  interaction.  ND,  not  done. 


Robert  et  al.  2006) .  Data  also  suggest  that  this  family  of 
HAT  proteins  acetylates  histone  H2AZ  (Babiarz  et  al. 
2006;  Keogh  et  al.  2006;  Millar  et  al.  2006) ;  this  histone 
variant  is  paradoxically  associated  with  both  regions  of 
active  gene  transcription,  as  well  as  with  centromeres 
and  silent  heterochromatin  (reviewed  in  Guillemette 
and  Gaudreau  2006) . 

We  characterized  two  mutations  affecting  mstl^  func¬ 
tion.  As  we  reported  previously,  the  Amstl  mutation 
causes  lethality  (Gomez  et  al.  2005).  In  this  report,  we 
show  that  spores  bearing  the  disruption  are  able  to 
germinate  and  complete  DNA  replication.  However, 
they  show  a  highly  disordered  mitosis.  We  also  con¬ 
structed  the  temperature-sensitive  mutant  mstl-L344S, 
which  contains  the  same  lesion  as  a  temperature- 
sensitive  budding-yeast  mutant.  At  the  restrictive  tem¬ 
perature,  this  strain  is  also  proficient  for  replication, 
whether  it  is  shifted  asynchronously  (Figure  2),  released 
from  nitrogen  starvation  in  Gi  (supplemental  Figure  1 ) , 
or  released  from  hydroxyurea  in  S  phase  (Figure  6) .  We 
observed  pleiotropic  phenotypes,  including  disordered 
mitosis,  irregular  segregation,  and  a  high  fraction  of  cell 
cycle  arrest  with  septa.  Cells  arrest  within  one  to  two  cell 
cycles  with  this  mixed  phenotype. 

Our  genetic  analysis  and  subsequent  two-hybrid 
screen  suggest  that  Mstl  impacts  three  pathways  (Table 
4):  damage  response,  which  is  consistent  with  the 
physiological  analysis  of  the  mstl-L344S  mutant  and 


data  from  other  systems;  centromere  function;  and 
control  of  mitotic  entry. 

Even  at  permissive  temperatures,  mstl-L344S  cells  are 
sensitive  to  a  variety  of  agents  that  disrupt  the  cell  cycle, 
including  bleomycin  and  MMS,  which  cause  DNA 
damage  and  double-strand  breaks;  hydroxyurea,  which 
causes  replication  fork  arrest;  and  thiabendazole,  which 
causes  spindle  depolarization.  The  damage  sensitivity 
was  expected  because  data  from  numerous  sources  link 
members  of  this  family  to  the  double-strand  break 
response.  The  recruitment  of  KAT5  orthologs  to  dou¬ 
ble-strand  breaks  is  thought  to  modulate  the  break 
response  and  perhaps  downregulate  the  signal  associ¬ 
ated  with  H2AX  phosphorylation  by  recruiting  the 
remodelers  INO80  and  SWRl  and  the  alternative 
histone  H2AZ  (Morrison  et  al.  2004;  Tsukuda  et  al. 
2005;  VAN  Attikum  and  Gasser  2005b;  Papamichos- 
Chronakis  et  al.  2006). 

When  mstl-L344S  is  combined  with  the  mutations 
^Thp51  (Rad51),  ^rad50,  or  S.rad22,  cells  have  a  severe 
synthetic  growth  phenotype  in  which  they  grow  more 
poorly  than  either  parent.  This  is  particularly  apparent 
in  I^rad22  mstl-L344S;  however,  because  Arad22  alone  is 
prone  to  accumulate  suppressors  in  the  phl^  gene 
(Osman  et  al.  2005),  we  cannot  be  sure  that  our  double 
mutant  is  not  also  carrying  a  suppressor.  It  is  possible 
that  ^rad22  mstl-L344Swill  be  synthetically  lethal  but  we 
have  not  been  able  to  test  this  convincingly.  These 
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1229  genetic  data  suggest  that  mstl-L344S  requires  a  func- 

1230  tional  recombination  pathway  for  viability.  To  assess 

1231  whether  mstl-L344S  causes  damage  itself,  we  examined 

1232  focus  formation  of  Rad22YFP  in  mstl-L344S  cells.  There 

1233  is  an  increase  in  foci  in  mstl  compared  to  wild  type  at 

1234  32°,  even  in  the  absence  of  any  damaging  treatment 

1235  (Figure  6F).  This  suggests  that  attenuation  of  Mstl 

1236  function  results  in  some  form  of  damage  that  requires 

1237  Rad22  for  repair,  but  also  suggests  that  Rad22-YFP  can 

1238  be  recruited  to  sites  of  damage  in  the  absence  of  Mstl. 

1239  We  observed  a  physical  association  between  Mstl  and 

1240  Rad22  in  two-hybrid  analysis  and  in  co-immunoprecip- 

1241  itation.  This  interaction  is  specific  to  Rad22,  as  neither 

1242  the  Rad22  homolog  Rtil  nor  the  Rad51  homolog  Rhp51 

1243  associate  with  Mstl  in  two-hybrid  tests  (Figure  4).  This 

1 244  could  reflect  a  role  in  recruitment.  Another  possibility  is 

1245  that  Mstl  acetylates  Rad22  itself,  as  TipGO  does  to  ATM 

1246  or  p53,  to  modulate  its  behavior,  and  we  will  investigate 

1247  this  in  the  future. 

1248  Unexpectedly,  mstl-L344S  cells  are  also  sensitive  to 

1249  hydroxyurea,  which  causes  replication  fork  arrest.  The 

1250  mutant  cells  arrest  properly  with  a  1C  content  without 

1251  Rad22-YFP  foci  and  appear  to  complete  S  phase  follow- 

1252  ing  release,  indicating  that  the  checkpoint  functions 

1253  and  the  replication  fork  are  intact;  instead,  they  have 

1254  modestly  reduced  viability  reminiscent  of  Arhp31  or 

1255  Arad22  mutants  (Meister  et  al  2005;  Bailis  et  al  2008) . 

1256  We  observe  that  HU  sensitivity  of  mstl-L344S  to  HU 

1257  differs  if  cells  are  exposed  to  low  levels  over  a  prolonged 

1258  [6]  time,  which  causes  cell  death  (chronic  exposure.  Figure 

1259  5),  or  whether  they  are  exposed  to  higher  levels  over  a 

1260  single  generation  (acute  exposure.  Figure  6).  Similar 

1261  results  have  been  observed  for  the  fork  protection 

1262  mutant  Aswil  (ScTofl),  which  survives  acute  exposure 

1263  but  dies  during  chronic  exposure  (Sommariva  et  al 

1264  2005) .  It  is  also  possible  that  this  HU  sensitivity  is  related 

1265  to  the  interaction  between  mstl  and  mutations  in 

1266  recombination  genes;  however,  Rad22-YFP  recruitment 

1267  appears  largely  normal  in  mstl  cells  treated  with  HU. 

1268  A  second  broad  functional  area  defined  by  our  two- 

1269  hybrid  and  genetic  interactions  is  centromere  assembly. 

1270  Fission  yeast  centromeres  are  large  and  complex  ele- 

1271  ments,  with  a  central  core  surrounded  by  flanking 

1272  repeats  that  form  peri-centromeric  heterochromatin 

1273  (reviewed  in  Ekwall  2007;  Grewal  and  Jia  2007). 

1274  Mutations  disrupting  the  heterochromatin,  the  central 

1275  core,  or  the  kinetochore  typically  show  sensitivity  to 

1276  spindle  poisons  such  as  TBZ,  and  a  high  fraction  of 

1277  chromosome  loss  and  disordered  mitoses,  as  we  ob- 

1278  served  (Figures  1,  2,  and  5).  Our  genetic  analysis 

1279  suggests  that  mutations  that  disrupt  normal  pericentro- 

1280  meric  heterochromatin  function  such  as  Ac/r5  (histone 

1281  deacetylase),  Ac/r4  (methyltransferase),  or  ^swi6  (het- 

1282  erochromatin  protein  1)  reduce  the  growth  tempera- 

1283  ture  of  mstl-L344S  quite  substantially.  The  simplest 

1284  interpretation  of  this  epistasis  experiment  is  that  mstl^ 

1285  affects  a  separate  pathway  from  Clr4  and  Swi6  in  cen¬ 


tromere  function,  leading  to  a  synthetic  effect  in  the 
double  mutant.  We  see  no  loss  of  silencing  at  centro¬ 
meres  or  telomeres  in  a  mstl  mutant,  which  suggests  that 
Swi6-formed  heterochromatin  is  largely  intact,  and 
some  other  aspect  of  centromere  assembly  and  function 
is  disrupted.  A  good  candidate  is  the  central  core.  Three 
of  the  proteins  that  we  identified  in  the  two-hybrid  tests 
are  linked  to  central  core  and  kinetochore  function: 

1.  We  isolated  Mscl,  which  was  first  identified  as  a  high- 
copy  suppressor  of  a  strain  lacking  the  checkpoint 
kinase  Chkl  (Ahmed  et  al  2004) .  A  recent  study  shows 
that  overexpression  of  Mscl  rescues  mutations  of 
cnpl  (the  specialized  H3-Cen/ CenpA  of  the  centro¬ 
mere  central  core)  in  addition  to  S.chkl  and  that  this 
suppression  requires  the  H2AZ  histone  variant  en¬ 
coded  by  phtl  +  (Ahmed  et  al  2007) .  H2AZ  is  linked 
to  heterochromatin  assembly  and  centromere  func¬ 
tion  (reviewed  in  Guillemette  and  Gaudreau 
2006)  and  H2AZ  is  a  known  substrate  of  KAT5  family 
HATs  (Babiarz  et  al  2006;  Keogh  et  al  2006;  Millar 
et  al  2006;  Auger  et  al  2008) . 

2.  We  isolated  Cbhl,  which  is  one  of  the  three  CenpB 
homologs  in  fission  yeast  (Baum  and  Clarke  2000; 
Nakagawa  et  al  2002).  The  CenpB  homologs  are 
proposed  to  affect  centromere  structure  (Nakagawa 
et  al  2002)  and,  in  metazoans,  CenpB  is  involved  in 
recruitment  of  H3-Cen/CenpA  (reviewed  in  Mel- 
lone  and  Allshire  2003) . 

3.  We  isolated  Hipl,  the  HIRA/Hirl  homolog.  HIR 
proteins  are  histone  chaperones  required  for  assem¬ 
bly  of  chromatin  outside  of  S  phase  (reviewed  in 
Gunjan  et  al  2005) .  In  fission  yeast,  deletion  of  Hipl 
causes  chromosome  loss,  TBZ  sensitivity,  and  disrup¬ 
tion  of  centromere  silencing  (Blackwell  et  al 
2004). 

HIR  proteins  in  S.  cerevisiae  are  implicated  in  normal 
kinetochore  function  (Sharp  et  al  2002).  In  contrast  to 
the  heterochromatin  mutants,  we  see  no  synthetic 
growth  phenotypes  when  mstl  is  combined  with  mutants 
l^cbhl,  ^mscl,  or  l^hipl.  This  would  be  consistent  with 
mstl^  acting  in  a  pathway  with  these  genes.  Together 
with  the  segregation  defects  of  mstl-L344S,  these  two 
hybrid  interactions  suggest  that  Mstl  may  affect  the 
centromere  central  core.  We  are  examining  mitotic 
phenotypes  of  mstl  in  more  detail  to  determine  how  this 
may  occur  and  the  role  of  H2AZ  in  this  process. 

There  is  indirect  evidence  that  Mstl  may  affect 
mitotic  entry.  We  isolated  the  Skbl  methyltransferase 
in  our  two-hybrid  screen.  Budding  yeast  Hsl7,  the 
ortholog  of  Skbl,  was  identified  because  mutations  of 
hsl7  are  synthetically  lethal  with  histone  tail  mutants 
(Ma  et  al  1996);  genetic  interactions  between  ^cl^hsl7 
and  ScEsal  suggest  that  histone  tail  modifications  are 
important  for  normal  mitosis  when  Swel  kinase  is 
activated  (Ruault  and  Pillus  2006).  While  we  ob¬ 
served  no  genetic  interaction  between  SpA^A:^!  and  our 
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temperature-sensitive  allele  of  mstl,  the  physical  in¬ 
teraction  that  we  observed,  together  with  the  genetic 
data  from  budding  yeast,  suggests  that  there  is  a  role  for 
histone  tail  modification  in  the  proper  regulation  of 
mitotic  entry  in  both  species.  Another  link  to  CDK 
activation  is  provided  by  our  isolation  of  Hipl  as  a 
putative  interactor.  The  ^hipl  mutation  is  synthetically 
lethality  with  cdc23-22,  which  is  a  temperature-sensitive 
mutation  in  the  mitotic  inducer  Cdc25  that  opposes 
Weel  (ScSwel)  (Blackwell  et  al.  2004;  reviewed  in 
Karls son-Rosenthal  and  Millar  2006). 

We  did  not  isolate  any  of  the  NuA4  complex  proteins 
in  our  two-hybrid  screen,  although  the  previously 
characterized  SpAlp5/ScArp4  is  known  to  interact  with 
Mstl  by  co-immunoprecipitation  (Minoda  et  al.  2005). 

0  This  is  not  surprising  as  the  two-hybrid  screen  identifies 
binary  associations,  and  many  of  the  interactions  in  the 
large  NuA4  complex  are  likely  to  be  indirect  (Auger 
et  al.  2008).  We  speculate  that  the  structure  of  the  two- 
hybrid  bait  construct,  in  which  the  amino-terminus  of 
Mstl  is  fused  to  the  DNA-binding  domain  of  Gal4,  may 
interfere  with  the  assembly  of  NuA4  subunits  with  Mstl. 
Conversely,  we  did  not  observe  co-immunoprecipitation 
of  epitope-tagged  Mstl  with  most  of  the  two-hybrid 
clones.  This  could  reflect  chromatin  association  of  the 
proteins  involved,  temporally  or  spatially  limiting  asso¬ 
ciation,  or  could  indicate  that  the  C-terminal  epitope 
tag  on  Mstl  is  not  available  when  the  protein  is  in  a 
complex  with  other  factors.  A  more  detailed  structure- 
function  analysis  and  localization  studies  are  in  progress 
to  resolve  these  observations. 

The  interactions  that  we  observed,  in  addition  to  the 
phenotypes  associated  with  the  mstl  mutants,  suggest 
that  Mstl  affects  a  variety  of  cell  functions.  Importantly, 
the  recent  discovery  that  Tip60  can  acetylate  nonhis¬ 
tone  proteins  in  the  absence  of  the  NuA4  complex  {e.g.. 
Sun  et  al.  2005;  Sykes  et  al.  2006;  Tang  et  al.  2006;  Kim 
et  al.  2007)  indicates  that  the  identification  of  Mstl  as  a 
histone  acetyltransferase  may  be  too  limiting  a  defini¬ 
tion.  We  speculate  that  the  pleiotropic  effects  of  mstl 
mutants  indicate  a  variety  of  functions  that  may  be 
directly  modulated  by  Mstl  catalytic  activity,  not  neces¬ 
sarily  linked  to  histone  acetylation.  Our  study  suggests 
that  Mstl  may  have  functions  in  replication  fork  stabil¬ 
ity  and  in  centromere  assembly,  and  experiments  are 
underway  to  determine  whether  these  effects  are  medi¬ 
ated  by  histone  modifications  or  by  other  aspects  of  Mstl 
function. 
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The  APOBEC-2  crystal  structure  and  functional 
Implications  for  the  deaminase  AID 

Courtney  Prochnow'* *,  Ronda  Bransteitter'*,  Michael  G.  Klein',  Myron  F.  Goodman'  &  Xiaojiang  S.  Chen' 


APOBEC-2  (AP02)  belongs  to  the  family  of  apolipoprotein  B  mes¬ 
senger  RNA-editing  enzyme  catalytic  (APOBEC)  polypeptides, 
which  deaminates  mRNA  and  single-stranded  DNA^’^.  Different 
APOBEC  members  use  the  same  deamination  activity  to  achieve 
diverse  human  biological  functions.  Deamination  by  an  APOBEC 
protein  called  activation-induced  cytidine  deaminase  (AID)  is  crit¬ 
ical  for  generating  high-affinity  antibodies^,  and  deamination  by 
APOBEC-3  proteins  can  inhibit  retrotransposons  and  the  replica¬ 
tion  of  retroviruses  such  as  human  immunodeficiency  virus  and 
hepatitis  B  virus^“^.  Here  we  report  the  crystal  structure  of  AP02. 
AP02  forms  a  rod-shaped  tetramer  that  differs  markedly  from  the 
square-shaped  tetramer  of  the  free  nucleotide  cytidine  deaminase, 
with  which  APOBEC  proteins  share  considerable  sequence  homo¬ 


logy.  In  AP02,  two  long  a-helices  of  a  monomer  structure  prevent 
the  formation  of  a  square-shaped  tetramer  and  facilitate  forma¬ 
tion  of  the  rod-shaped  tetramer  via  head-to-head  interactions  of 
two  AP02  dimers.  Extensive  sequence  homology  among  APOBEC 
family  members  allows  us  to  test  AP02  structure-based  predic¬ 
tions  using  AID.  We  show  that  AID  deamination  activity  is 
impaired  by  mutations  predicted  to  interfere  with  oligomerization 
and  substrate  access.  The  structure  suggests  how  mutations  in 
patients  with  hyper-IgM-2  syndrome  inactivate  AID,  resulting  in 
defective  antibody  maturation. 

We  crystallized  AP02,  which  contains  amino  acid  residues 
41-224,  with  four  monomers  in  each  asymmetric  unit  that  form  a 
tetramer  with  an  atypical  elongated  shape  (Fig.  la).  This  tetramer 


(31 '-hairpin 


Figure  1  |  The  structure  of  AP02.  a,  The  AP02  tetramer  structure.  It  has  an 
end-to-end  span  of  —  126.9  A.  Zn  atoms  in  the  active  centres  are  shown  as  red 
spheres,  b,  The  square-shaped  structure  of  human  cytidine  deaminase  (PDB 
accession  number:  IMQO),  a  fntCDA.  c,  d,  The  AP02  monomer  structure 
rotated  by  90  degrees,  showing  the  unique  features  of  AP02:  the  short  (31' 
strand  and  helices  h4  and  h6.  h4  and  h6  dictate  how  AP02  oligomerizes. 


e.  The  AP02  dimer  formed  by  two  monomers  (in  purple  and  yellow).  Each 
has  a  different  conformation  for  the  hl/(3l-turn  (in  red):  a  loop  (LI)  and  a 
hairpin,  f.  The  tetrameric  interface,  showing  the  extensive  interactions 
mediated  through  h4,  h6  and  LI.  g.  The  stick  model  (hydrophobic,  polar  and 
charged  amino  acids  in  h4,  h6  and  LI)  of  the  interactions  at  the  tetramer 
interface. 
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assembles  through  two  different  monomer-monomer  interfaces,  in 
contrast  to  the  canonical  square  shape  of  the  free  nucleotide  cytidine 
deaminase  (fntCDA)  tetramer  (Fig.  lb),  in  which  all  four  monomers 
interact  with  each  other^.  The  elongated  AP02  tetramer  has  the  shape 
of  a  butterfly  (Fig.  la)  with  an  end-to-end  span  of  approximately 
126.9  A. 

The  AP02  monomer  appears  to  adopt  the  typical  core  fold  of  the 
fntCDAs  with  a  five-stranded  P-sheet  fianked  by  helices  on  both  sides 
(Figs  Ic,  d).  However,  one  new  attribute  is  the  additional  ot-helices 
surrounding  the  core  P-sheet  (Figs  Ic,  d);  six  long  helices  are  present 
in  the  AP02  monomer  whereas  only  three  or  four  are  observed  in  the 
fntCDA  monomer  (excluding  the  shorter  3io  helices) Helices  h3 
and  h6  make  extensive  contacts  with  h4,  stabilizing  the  position  of  the 
helices  within  the  monomer  subunit.  On  the  basis  of  the  close 
sequence  homology  of  AP02  with  other  APOBEC  proteins,  the  long 
helix  (h4)  probably  serves  as  a  structural  signature  of  this  family 
(Figs  Ic,  d). 

The  AP02  dimer  is  formed  by  pairing  two  long  P-strands  (P2) 
(Fig.  le),  joining  two  P-sheets  sideways  to  form  one  wide  P-sheet  that 
resembles  a  ribcage  (Fig.  la,  e).  Twelve  residues  (residues  82-93)  on 
each  P2  strand  form  12  hydrogen  bonds  through  main-chain  atoms, 
providing  the  principal  bonding  force  between  the  two  monomers. 
The  dimer  interface  is  reinforced  by  the  side-chain  interactions 
occurring  through  the  loops  and  helices  located  on  both  sides  of 
the  P-sheet.  Ordered  water  molecules  also  help  to  stabilize  this 
interface. 


The  dimer  is  nearly  symmetrical  (Fig.  le)  with  six  helices  (h2,  h3 
and  h4  of  both  molecules)  located  on  one  side  of  the  augmented 
P-sheet  and  four  helices  (hi  and  h5  of  both  monomers)  on  the  other 
side.  Capped  on  both  edges  of  the  P-sheet  are  h4  and  h6.  However, 
one  part  of  the  dimer  shows  obvious  asymmetry  at  the  turn  between 
hi  and  strand  pi  (hl/pi-turn).  This  hl/pl-turn  (residues  57-68) 
assumes  a  hairpin  structure  (Pl '-hairpin)  in  one  monomer,  and  a 
loop  conformation  (LI)  in  the  other  monomer  (Fig.  le,  f). 

The  AP02  tetramer  is  formed  by  two  dimers  joining  through 
head-to-head  interactions.  The  two  dimers  make  extensive  contacts 
via  the  residues  from  h4  and  h6,  as  well  as  the  loop  LI  at  the  hl/pi- 
turn  (Lig.  If).  Residues  Y61,  L155,  M156,  W157,  P160,  Y214  and 
Y215  from  each  side  of  the  interface  form  extensive  hydrophobic 
packing  interactions,  and  residues  R57,  S62,  S63,  R153,  E158,  E159 
and  E161  establish  salt  bridges  and  hydrogen  bonds  (Lig.  Ig).  Some 
charged  residues  even  use  their  aliphatic  side  chains  to  interact  with 
hydrophobic  residues.  Thus,  hydrophobic,  polar  and  charged  amino 
acid  side  chains  are  all  involved  in  the  tetramerization  interactions. 
The  total  buried  area  is  1,745  A^  within  the  tetramer  interface,  where 
h4  and  h6  play  a  major  role  forming  the  interface  (Lig.  If).  h4  and  h6 
also  sterically  hinder  the  formation  of  the  square-shaped  fntCDA- 
type  tetramer  by  occupying  the  space  where  another  monomer  would 
need  to  be.  Therefore,  h4  and  h6  appear  to  determine  directly  the 
elongated  tetramer  formation. 

A  prominent  feature  of  the  AP02  tetramer  distinctive  from  the 
fntCDA  tetramer  is  that  the  active  sites  are  accessible  for  large  RNA  or 


Figure  2  |  The  AP02  active  site,  a.  The  AP02  active  sites  are  accessible  to 
DNA/RNA.  Red  spheres  represent  Zn.  b.  The  fntCDA  active  site  is  accessible 
only  to  free  nucleotides,  c.  The  outer  AP02  active  sites  show  Zn 
coordination  (yellow  dashed  lines)  by  three  residues  (H98,  C128,  C131)  and 
a  water  molecule  (blue  sphere),  d.  The  middle  AP02  active  centre  sites  show 
Zn  coordination  by  a  fourth  residue,  E60.  e.  In  the  (31' -hairpin  structure,  the 


hydrophobic  ring  of  Y61  interacts  with  the  guanidine  group  of  R65, 
stabilizing  the  conformation,  f.  In  the  hl/(3l  loop,  the  E60  coordinates  with 
Zn.  Y61  now  rotates  away  from  R65  and  interacts  with  R57,  facilitating  the 
disruption  of  the  (3  T -hairpin  and  stabilizing  the  loop  conformation, 
g.  Superimposed  monomers  show  that  the  hl/(3l  loop  (purple)  is  pulled 
down  —8.5  A  towards  the  active  site  owing  to  the  E60-Zn  bond  formation. 
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DNA  substrates  (Fig.  2a).  In  the  square-shaped  fntCDA  tetramer, 
loops  from  two  neighbouring  monomers  cover  the  active  sites  so 
that  only  small  free  nucleotides  can  bind  to  the  buried  sites 
(Fig.  2b).  Although  the  yeast  fntCDA,  CDDl,  has  been  reported  to 
deaminate  the  apolipoprotein  B  mRNA  in  vitrOy  its  known  biological 
substrate  in  vivo  is  a  free  nucleotide  and  the  CDDl  structure  is  a 
canonical  square-shaped  tetramer^. 

In  fntCDAs,  the  active  centre  Zn  atom  is  coordinated  by  three 
residues  (either  three  cysteines,  or  two  cysteines  +  one  histidine) 
and  forms  a  fourth  bond  with  a  water  molecule  with  a  bond  distance 
of  —3.0  A  (ref.  10).  This  type  of  Zn  coordination  is  also  present  in 
AP02  (Fig.  2c),  but  only  in  the  two  outer  monomers  of  the  tetramer. 
Surprisingly,  the  active  sites  for  the  other  two  monomers  in  the 
middle  of  a  tetramer  contain  an  E60  residue,  which  replaces  the  water 
molecule  and  makes  the  fourth  coordination  bond  with  the  Zn 
(Fig.  2d).  This  coordination  of  Zn  by  four  residues  is  unexpected 
given  that  all  known  fntCDA  structures  have  only  three  amino  acid 
residues  participating  in  Zn  coordination^"^^. 

A  closer  examination  of  the  structure  reveals  a  ‘built-in’  mech¬ 
anism  for  a  conformational  switch  between  the  two  types  of  Zn 
coordination.  The  switch  is  mediated  by  sequences  contained  in 
the  hl/pi-turn,  in  which  E60  is  located.  The  hl/pi-turn  can  adopt 
either  a  hairpin  ( P 1 ' -hairpin,  Eig.  2e)  or  a  loop  (LI)  conformation 
(Eig.  2f),  which  controls  whether  or  not  E60  coordinates  with  Zn.  The 
E60  is  located  6  A  from  the  Zn  when  the  hl/pi-turn  is  a  pi '-hairpin 
(Eig.  2e).  The  pi '-hairpin  is  stabilized  by  main-chain  hydrogen 
bonds  within  the  pi  '-hairpin  and  reinforced  by  interactions  between 
Y61  and  the  guanidine  group  of  R65.  In  the  two  middle  monomers  of 
an  AP02  tetramer,  the  hl/pi-turn  folds  into  a  loop  (LI,  Eig.  If).  In 
this  conformation,  Y61  rotates  its  side  chain  to  interact  with  R57 
instead  of  the  R65  (Eig.  2f,  g).  The  new  pairing  of  Y61  with  R57 
destabilizes  the  pi '-hairpin  while  stabilizing  the  loop.  In  the  loop 
conformation,  the  E60  is  2.2  A  from  the  Zn  (Eig.  2f). 


The  hairpin-loop  switch  may  have  two  important  consequences. 
Eirst,  switching  to  the  loop  and  forming  the  fourth  Zn  coordination 
by  E60  prevents  coordination  by  water  and  subsequent  hydroxyla- 
tion  of  Zn  necessary  for  deamination  (Eig.  2d,  f).  Second,  Zn  coor¬ 
dination  by  E60  pulls  the  hl/pi-turn  approximately  8.5  A  towards 
the  active  centre  (Eig.  2g),  which  could  restrict  substrate  access  to  the 
active  centre.  On  the  other  hand,  breaking  of  the  fourth  coordination 
of  E60  may  allow  the  loop  to  move  away  from  the  active  centre  to 
form  the  pi '-hairpin  as  observed  in  the  outer  monomers.  The  E60 
would  no  longer  prevent  the  Zn  hydroxylation  and  nucleic  acid  sub¬ 
strate  access  to  those  active  sites.  Thus,  the  hairpin-loop  switch  can 
be  a  molecular  mechanism  for  regulating  substrate  access  and 
enzyme  activity  mediated  through  Zn  coordination. 

The  AP02  fragment  in  the  structure  shares  a  33.3%  amino  acid 
identity  (44.6%  homology)  with  AID,  and  the  buried  residues  in 
AP02  share  a  75%  identity  (96%  homology)  with  AID  (Eig.  3a). 
The  highly  conserved  residues  buried  inside  the  structure  and 
those  located  at  the  dimer/tetramer  interfaces  strongly  suggest  a 
structural  conservation  of  AID  with  AP02.  Thus,  the  AP02  crystal 
structure  should  provide  functional  insights  for  AID  and  other 
APOBEC  family  members,  despite  of  the  lack  of  the  known  biological 
activity  of  AP02.  Eor  this  reason,  we  use  AID  as  a  surrogate  to  test 
how  mutations  guided  by  AP02  structure  affect  AID  deamination 
activity. 

We  generated  glutathione  S- transferase  (GST) -AID  mutants  with 
amino  acid  substitutions  located  at  the  tetrameric  interface  (Eig.  3b), 
and  showed  that  the  mutants  either  had  no  detectable  or  signifi¬ 
cantly  reduced  deaminase  activity  compared  to  wild-type  GST- 
AID  (Eig.  3d-f).  Mutants  R112C  and  Y114A/E115A  were  inactive 
(Eig.  3e,  f),  while  mutants  K16A  and  Cl  16A  had  a  3. 3 -fold  reduction 
in  activity  (Eigs  3e,  f). 

AID  mutations  within  the  predicted  dimerization  domain,  E46A/ 
Y48A  (Eig.  3c),  displayed  a  fourfold  decrease  in  deamination  activity 

Figure  3  |  Structurally  guided  mutagenesis  of 
AID  impairs  deamination  activity,  a,  A  sequence 
alignment  of  AP02  and  AID,  showing  significant 
homology.  Red,  identical  residues;  grey  shading, 
buried  residues;  red  squares,  active  centre 
residues;  green  dots,  tetrameric  interface 
residues;  blue  diamonds,  dimeric  interface 
residues;  purple  stars,  HIGM  mutated  residues; 
and  black  triangles,  mutated  AID  residues, 
b.  Mutated  AID  residues  (in  green)  at  the 
tetramer  interface  as  modelled  based  on  the 
AP02  structure,  c.  Mutated  AID  residues  (in 
green)  in  the  dimer  interface  as  modelled  based 
on  the  AP02  structure,  d,  A  sketch  describing  the 
cytidine  deamination  assay.  F,  fluorescein;  UDG, 
is  uracil  DNA  glycosylase.  e.  Denaturing  PAGE 
analysis  of  the  deamination  activity  for  wild-type 
and  mutant  AID  proteins.  The  30-nucleotide  (nt) 
band  indicates  deamination  activity,  f.  Bar 
representation  of  the  specific  activities  for  wild- 
type  and  mutant  AID  proteins. 
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Figure  4  |  AID  HIGI\/l-2  mutations,  a,  Alignment  of  mutated  residues  of  AID 
from  HIGM-2  patients  with  the  corresponding  residues  in  AP02,  showing 
high  sequence  conservation,  b,  Mapping  the  residues  in  AID  HIGM-2 
mutations  (RII2,  LI  13,  NI68)  to  the  tetramer  interface  as  modelled  from 
the  AP02  structure,  c,  Mapping  the  AID  HIGM-2  mutations,  S83  and  S85, 
near  the  active  site,  d.  Mapping  the  AID  mutations,  KI6,  YI  14/FI  15  and 


Cl  16  (in  green),  to  the  exposed  surface  of  an  outer  monomer.  The  HIGM-2 
AID  residues  (RII2,  LI  13,  N168,  in  yellow),  which  are  at  the  tetramer 
interface  (b),  are  also  located  on  this  exposed  surface,  e.  Mapping  of  AID 
HIGM-2  mutations,  W80,  LI 06,  MI 39  and  FI 5 1,  to  the  interior  core 
structure. 


(Fig.  3e,  f).  The  dimer  interface  is  extensive,  so  two  mutations  should 
not  completely  disrupt  dimeric  AID.  This  explains  why  weak  deami¬ 
nation  activity  was  observed  with  this  double  AID  mutant.  These 
mutational  results  suggest  that  the  residues  within  the  predicted 
dimeric  and  tetrameric  interfaces  are  important  for  deamination 
activity.  One  caveat  is  that  the  residues  on  the  tetramer  interface 
are  also  present  on  the  exposed  surface  of  the  outer  ends  of  the 
tetramer  and  thus  could  also  be  involved  in  an  additional  role  beside 
tetramerization  (Figs  3b,  4d).  We  noticed  that  in  gel  filtration  assays 
the  tetramer  was  a  minor  species  when  compared  with  the  dimer, 
suggesting  a  stronger  dimeric  interaction. 

AID  has  an  arginine  (R19)  at  the  equivalent  position  of  the  AP02 
E60  residue  (Fig  3a)  that  may  have  a  negative  regulatory  role  for 
AP02  activity  by  blocking  Zn  hydroxylation  and  substrate  access 
(Fig.  2f).  We  showed  that  an  AID  R19E  mutant  mimicking  AP02 
E60  had  a  significantly  decreased  deamination  activity  (about  4.6- 
fold  less  than  the  wild  type.  Figs  3e,  f).  Similarly,  the  AID  R24  residue 
is  equivalent  to  AP02  R65,  which  interacts  with  Y61  of  AP02  to 
stabilize  the  open  pi '-hairpin  conformation.  We  predicted  that  the 
disruption  of  the  R65-Y61  interaction  would  collapse  the  pi  '-hairpin 
into  the  closed  loop  conformation  to  block  substrate  access  and 
impair  deamination  activity.  Indeed,  the  AID  R24E  mutant  was  com¬ 
pletely  inactive  on  single-stranded  DNA  (Fig.  3e,  f). 

Mutations  in  human  AID  cause  hyper-IgM-2  (HIGM-2)  syn¬ 
drome,  characterized  by  an  impaired  production  of  high-affinity 
antibodies^^’^^.  The  mutated  AID  residues  of  HIGM-2  patients  are 
highly  conserved  in  AP02  (Fig.  4a).  A  plausible  explanation  for  why 
and  how  HIGM-2  mutations  disrupt  AID  function  is  given  by  the 
structure  of  AP02  (Fig.  4b-e).  On  the  basis  of  the  crystal  structure, 
HIGM-2  AID  mutations  can  be  divided  into  four  classes.  The  first 
mutant  class  (Alll,  R112,  LI  13  and  N168)  occurs  at  the  tetramer¬ 


ization  interface  (Fig.  4b).  The  second  mutant  class  includes  residues 
in  and  near  the  active  centre  (Fig.  4c),  H56,  E58,  S83,  S85  and  C87, 
which  are  conserved  among  all  APOBEG  enzymes.  The  AID  R24 
residue  is  also  mutated  in  HIGM-2  patients.  As  previously  discussed, 
R24  may  stabilize  the  P 1 '  -hairpin,  which  keeps  the  active  site  open 
for  DNA/RNA  access.  The  third  class  consists  of  residues  located  on 
the  enzyme  surface  (Fig.  4d),  including  those  residues  located  at  the 
tetramer  interface  (Alll,  R112,  LI  13,  N168;  see  Fig.  4d).  A  fourth 
class  of  HIGM-2  AID  mutations  are  those  with  large  hydrophobic 
side  chains  buried  within  the  core  (Fig.  4e),  including  W80,  LI 06, 
Ml 39  and  F151.  Three  of  these  residues  are  located  near  the  active 
centre.  Mutating  these  residues  should  disrupt  the  folding  and 
stability  of  AID. 

Since  many  of  the  APOBEG  enzymes  are  reported  to  form  dimers 
and  multimers,  the  AP02  structure  may  shed  light  on  how  these 
enzymes  oligomerize The  elucidation  of  the  AP02  structure, 
fortified  by  the  structure-guided  predictions  for  the  activity  of  spe¬ 
cific  AID  mutants,  provides  a  structural  basis  to  pursue  further  func¬ 
tional  studies  of  APOBEG  proteins  with  an  eye  towards  developing 
therapeutic  strategies  to  deal  with  deficiency  in  deaminating  cytidine 
and  to  restrict  retroviral  replication. 

METHODS 

Crystallography  statistics  can  be  found  in  the  Supplementary  Information. 
Protein  purification  and  crystallization.  Human  AP02  containing  residues  41- 
224  was  cloned  and  expressed  in  Escherichia  coli  as  a  recombinant  GST  fusion 
protein.  Following  GST  cleavage  by  thrombin,  further  purification  of  AP02  was 
achieved  using  Superdex-75  gel  filtration  chromatography.  Native  and  sel¬ 
enium-methionine  labelled  protein  was  concentrated  to  15  mgmP^  in  a  buffer 
containing  25  mM  Hepes,  pH  7.0,  50  mM  NaCl  and  10  mM  dithiothreitol. 
Crystals  were  grown  at  18  °C  by  hanging- drop  vapour  diffusion  from  a  reservoir 
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solution  of  85  mM  Na-citrate,  pH  5.6,  160  mM  LiS04,  24%  (weight/volume) 
polyethylene  glycol  monomethyl  ether  and  15%  glycerol. 

Structure  determination  and  refinement.  Native  and  selenium- multiwave¬ 
length  anomalous  diffraction  (MAD)  data  were  collected  at  the  synchrotron 
and  processed  using  HKL2000^^  (Supplementary  Table  1).  An  initial  solution 
was  obtained  at  3.5  A  using  the  program  Solve^^  and  located  eight  selenium 
atoms  using  the  peak  wavelength  data  set.  A  search  with  SHELXD^^  found  four 
additional  selenium  atoms  (totalling  twelve)  and,  subsequently,  the  program 
SHARP  identified  four  additional  weaker  anomalous  scattering  atoms,  which 
were  recognized  as  Zn  atoms.  Density  modification  schemes  of  solvent  flattening 
and  fourfold  non- crystallographic  symmetry  (NCS)  averaging  were  applied 
using  the  program  RESOLVE^^.  Additionally,  phase  extension  in  RESOEVE 
was  performed  with  the  native  data  set  to  2.5  A  resolution  using  the  two-wave- 
length  MAD  phases  calculated  in  SHARP.  The  molecular  model  was  built  based 
on  this  experimental  map  using  the  program  “O”  and  was  refined  with  the 
Crystallography  and  NMR  System  (CNS).  A  twofold  NCS  constrain  was  applied 
during  the  initial  simulated  annealing,  but  the  final  refinement  was  carried  out 
without  NCS  constrain.  The  protein  geometry  is  excellent  when  examined  using 
the  program  PROCHECK. 

Construction  of  AID  mutants.  Mutant  AID  proteins  were  contructed  by  site- 
directed  mutagenesis  using  the  pGEX-KG-AID  vector  as  the  PCR  template  and 
primers  specific  for  the  respective  mutations  (5'  -CTG  AGG  ATC  TTC  ACC  GCG 
TGC  CTC  TAG  TTC  TGT  GAG  GAC-3'  (R1 12C),  5'-ATC  TTC  ACC  GCG  CGC 
CTG  GCG  GCG  TGT  GAG  GAG  CGC  AAG  GCT-3'  (Y1 14/Yl  15),  5' -GCG  CGC 
CTC  TAG  TTC  TGT  GCG  GCC  CGC  AAG  GCT  GAG  CCC  GAG-3'  (El  17/ 
E118A),  5' -AAG  TTT  CTT  TAG  CAA  TTC  GCA  AAT  GTC  CGC  TGG  GCT 
AAG-3'  (K16A),  5'-ACC  GCG  CGC  CTC  TAG  TTC  GCT  GAG  GAG  CGC  AAG 
GCT  GAG-3'  (Cl  16A),  5 '-TAG  CAA  TTC  AAA  AAT  GTC  GAG  TGG  GCT  AAG 
GGT  CGG  CGT-3'  (R19E),  and  5'-ACA  TGC  TTT  TCA  CTG  GAG  GCT  GGT 
GCT  CTT  CGC  AAT  AAG  AAG  GGC-3'  (E46A/Y48A).  Mutant  constructs  were 
verified  by  DNA  sequencing. 

Deamination  reactions.  Deamination  experiments  were  performed  as  previously 
described^^  with  the  exception  that  the  substrate  used  was  a  fluorescein- dT 
incorporated  single-stranded  DNA  substrate  (5'-taa  agg  fluorescein-dTga 
aga  gag  gag  aga  gaa  gta  agC  tga  aga  gag  aga  agg  aag  aga  gtg  aag  gag-3'). 
Reaction  products  were  visualized  on  a  BioRad  EX  scanner. 
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Activation-induced  cytidine  deaminase  (AID)  initiates 
somatic  hypermutation  and  class  switch  recombination  in  B 
cells  by  deaminating  C  — >  U  on  transcribed  DNA.  Here  we  ana¬ 
lyze  the  role  of  phosphorylation  and  phosphorylation-null 
mutants  on  the  biochemical  behavior  of  AID,  including  enzyme 
specific  activity,  processivity,  deamination  spectra,  deamination 
motif  specificity,  and  transcription-dependent  deamination  in 
the  presence  and  absence  of  RPA.  We  show  that  a  small  fraction 
of  recombinant  human  AID  expressed  in  SJ9  insect  cells  is  phos- 
phorylated  at  previously  identified  residues  Ser^^  and  Thr^^  and 
also  at  Ser^^  and  Ser^^.  S43P  AID  has  been  identified  in  a  patient 
with  hyper-IgM  immunodeficiency  syndrome.  Ser-substituted 
phosphorylation- null  mutants  (S38A,  S41A,  S43A,  and  S43P) 
exhibit  wild  type  (WT)  activity  on  single-stranded  DNA.  Deami¬ 
nation  of  transcribed  double-stranded  DNA  is  similar  for  WT 
and  mutant  AID  and  occurs  with  or  without  RPA.  Although  WT 
and  AID  mutants  catalyze  processive  deamination  favoring 
canonical  WRC  hot  spot  motifs  (where  W  represents  A/T  and  R 
is  A/G),  their  deamination  spectra  differ  significantly.  The  dif¬ 
ferences  between  the  WT  and  AID  mutants  appear  to  be  caused 
by  the  replacement  of  Ser  as  opposed  to  an  absence  of  phospho¬ 
rylation.  The  spectral  differences  reflect  a  marked  change  in 
deamination  efficiencies  in  two  motifs,  GGC  and  AGC,  which 
are  preferred  by  mutant  AID  but  disfavored  by  WT  AID.  Both 
motifs  occur  with  exceptionally  high  frequency  in  human  switch 
regions,  suggesting  a  possible  relationship  between  AID  deami¬ 
nation  specificity  and  a  loss  of  antibody  diversification. 


AID,  a  B-cell  specific  protein,  is  required  for  tightly  regulated 
mechanisms  of  Ig  antibody  diversification,  somatic  hypermu- 
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tation,  class  switch  recombination,  and  gene  conversion.  SHM^ 
is  characterized  by  an  exceptionally  high  mutation  rate  —10“^ 
to  10“^  per  base  pair  per  cell  division  within  the  V(D)J  rear¬ 
ranged  Ig  genes  (1).  CSR  is  a  unique  region  specific  event  in 
which  the  donor  switch  region  upstream  of  the  IgH  exon 
(coding  for  an  IgM  antibody)  recombines  with  one  of  the  switch 
regions  that  is  5'  to  each  of  the  downstream  C  exons,  such  as  Cy 
C^,  or  C^,  producing  IgG,  IgA,  or  IgE  antibody  isotypes,  respec¬ 
tively  (2). 

Typically,  under  tight  transcriptional  control,  AID  is  induced 
in  activated  B  cells  within  germinal  centers,  causing  deamina¬ 
tion  of  cytosine  residues  in  the  variable  and  switch  regions  (V- 
and  S-regions,  respectively)  of  transcribed  Ig  loci.  Deaminated 
DNA  is  subsequently  replicated  or  repaired  by  different  cellular 
repair  mechanisms  to  give  rise  to  diversified  isotype-switched 
and  antigen-specific  high  affinity  antibodies  (for  reviews,  see 
Refs.  3-5). 

Biochemical  studies  have  shown  that  purified  native  B-cell 
AID  or  recombinant  AID  protein,  expressed  in  SJ9  insect  cells 
or  Escherichia  coli,  deaminates  C  residues  on  ssDNA  but  not 
dsDNA,  single-stranded  RNA,  or  DNA/RNA  hybrid  molecules 
(6-9).  AID  acts  processively  on  ssDNA  and  has  a  distinctive 
deamination  specificity,  favoring  C  targets  in  WRC  hot  spot 
motifs  (where  W  represents  A  or  T,  and  R  is  purine)  while 
avoiding  SYC  cold  spots  (where  S  represents  C  or  G  and  Y  is 
pyrimidine)  (10,  11).  These  intrinsic  biochemical  properties  of 
AID  are  likely  to  be  significant  determinants  of  SHM  hallmark 
features  in  B-cells,  favoring  mutations  at  WRC  hot  spot  motifs 
and  broad  clonal  heterogeneity  of  mutations  within  V-regions 
(12-14). 

Transcription  is  required  for  both  SHM  and  CSR  to  occur 
(15-17).  Since  AID  does  not  appear  to  act  on  dsDNA,  tran¬ 
scription  probably  plays  an  essential  role  in  generating  tran¬ 
sient  ssDNA,  its  principal  substrate.  The  ability  of  AID  to 
deaminate  C  residues  on  dsDNA  undergoing  active  transcrip¬ 
tion  has  been  demonstrated  in  cells  and  cell-free  assays.  For 
example,  ectopically  expressed  AID  can  induce  mutations  on 
highly  transcribed  target  gene  in  fibroblasts  (18)  and  E,  coli  (9, 


^The  abbreviations  used  are:  SHM,  somatic  hypermutation;  AID,  activation- 
induced  cytidine  deaminase;  WT,  wild  type;  CSR,  class  switch  recombina¬ 
tion;  ssDNA,  single-stranded  DNA;  dsDNA,  double-stranded  DNA;  HIGM-2, 
hyper-IgM  syndrome;  Ml,  mutability  index;  nt,  nucleotide(s). 
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19),  and  purified  AID  expressed  in  various  sources  has  been 
shown  to  act  on  dC  targets  on  linear  dsDNA  and  closed  circular 
dsDNA  model  substrates  when  transcribed  by  a  prokaryotic 
RNA  polymerase  (7, 10, 11, 20).  AID  has  been  shown  to  interact 
with  human  RNA  polymerase  II  (21),  although  there  are  cur¬ 
rently  no  biochemical  data  for  AID  acting  in  conjunction  with 
the  human  transcription  machinery  to  deaminate  dsDNA. 

Native  AID  isolated  from  stimulated  primary  B-cell  nuclei 
is  phosphorylated  on  Ser^^  and  Tyr^^^  residues  (22,  23), 
where  Ser^^  phosphorylation  occurs  on  about  5-15%  of  total 
AID  protein  (23).  Phosphorylation  does  not  appear  to  be 
B-cell-specific,  since  a  similar  or  greater  level  of  Ser^^-phos- 
phorylated  AID  was  observed  when  expressed  in  293T  and 
3T3-NTZ  cells  (23).  Protein  kinase  A  (PKA)  was  found  to 
interact  with  AID  (22,  24)  and  appears  to  be  a  primary  kinase 
responsible  for  phosphorylation  at  Ser^^  and  possibly  at 
Thr^^  residues  in  vivo  and  in  vitro  (22,  24). 

The  potential  role  of  phosphorylation  of  AID  in  SHM  and 
CSR  was  deduced  primarily  from  studies  of  AID  mutations  at 
phosphorylated  residues  Thr^^,  Ser^^,  and  Tyr^^^  A  phospho¬ 
rylation-defective  mutant,  Y184A,  did  not  affect  CSR  (22).  In 
contrast,  a  mutation  at  S38A  reduced  CSR  dramatically  to  no 
more  than  20%  of  the  wild  type  (22, 24)  or  moderately  (35-  80%) 
(23,  25)  in  ex  vivo  stimulated  B-cells.  AID  S38A  caused  a  signif¬ 
icant  decrease  of  SHM  in  B-cells  (23)  and  SHM  and  gene  con¬ 
version  in  chicken  DT40  cells  (26).  Phosphorylated  Ser^^  was 
reported  to  be  required  for  interaction  with  the  32-kDa  subunit 
of  RPA  (replication  protein  A)  (27).  AID  S38A  did  not  interact 
with  RPA;  nor  did  it  catalyze  deamination  of  C  on  dsDNA  in  a 
model  T7  RNA  polymerase  transcription  system  (22,  27). 

There  are  data,  however,  indicating  a  lesser  role  of  Ser^^ 
phosphorylation  in  regulating  AID  activity.  Zebrafish  AID, 
which  lacks  a  serine  at  a  position  corresponding  to  human  or 
mouse  Ser^^,  is  fully  active  in  supporting  SHM  and  CSR  (28, 
29)  and  gene  conversion  (26).  Recombinant  AID  expressed 
in  Sfi  cells  and  E,  coli  was  found  to  be  active  on  transcribed 
dsDNA  substrates  in  the  absence  of  RPA  (9-11,  30,  31). 
Thus,  although  it  is  clear  that  a  minority  of  AID  molecules 
are  phosphorylated  at  Ser^^,  it  is  considerably  less  clear  how 
the  biochemical  behavior  of  AID  is  influenced  by  either  the 
location  of  phosphorylated  residues  or  whether  or  not  a  par¬ 
ticular  residue  is  phosphorylated. 

The  roles  of  phosphorylation  on  the  properties  of  AID  in 
vitro  are  amenable  to  a  comparative  biochemical  analysis 
using  wild  type  (WT)  AID  and  phosphorylation-defective 
mutants.  Here  we  have  investigated  how  replacing  individual 
“phosphorylation-active”  Ser  residues  influences  AID  spe¬ 
cific  activity,  processivity,  deamination  specificity,  trans¬ 
criptional-dependent  deamination,  including  its  interac¬ 
tions  with  RPA,  and  mutational  spectra.  One  such  mutant, 
S43P,  has  been  identified  in  humans  diagnosed  with  hyper- 
IgM  (HIGM-2)  syndrome  (32),  characterized  by  the  absence 
of  CSR.  A  comparison  of  deamination  motif  specificities 
favored  by  S43P  in  relation  to  WT  AID  suggests  a  possible 
connection  of  AID  deamination  specificity  with  human 
immunodeficiency  disease. 


EXPERIMENTAL  PROCEDURES 

Enzymes  and  Substrates— Mutdint  AID  proteins  (S38A, 
S38D,  S41A,  S41D,  S43A,  and  S43P)  were  constructed  by  site- 
directed  mutagenesis  (QuikChange  site-directed  mutagenesis 
kit,  Stratagene)  using  the  pAcG2T-AID  vector  (6,  10)  as  the 
template.  Recombinant  baculoviruses  encoding  WT  and 
mutant  AID  were  generated  according  to  the  recommended 
protocol  (BD  Bioscience).  WT  and  mutant  GST- AID  proteins 
were  expressed  and  purified  as  described  previously  (10,  11), 
with  an  additional  of  Halt  phosphatase  inhibitor  mixture 
(Pierce)  in  the  lysis  buffer.  AID  proteins  were  dialyzed  in  a 
buffer  containing  20  mM  Tris-HCl  (pH  7.5),  25  mM  NaCl,  1  mM 
dithiothreitol,  1  mM  EDTA,  and  10%  glycerol  and  stored  at 
—  80  °C.  E.  coli  single-stranded  binding  protein  and  recombi¬ 
nant  human  RPA  were  overexpressed  in  E.  coli  and  purified 
according  to  published  protocols  (33,  34).  T7  RNA  polymerase 
was  purchased  from  Promega,  and  ultrapure  NTP  was  pur¬ 
chased  from  Amersham  Biosciences.  The  active,  recombinant 
catalytic  subunit  of  human  protein  kinase  A  was  purchased 
from  Calbiochem.  M13mp2  gapped  DNA  and  M13mp2T7 
covalently  closed  circular  dsDNA  substrates  were  prepared  as 
described  (10, 11). 

Mass  Spectrometry  Approximately  20  pg  of  puri¬ 

fied  AID  were  reduced  with  5  mM  dithiothreitol  in  the  presence 
of  1%  SDS  and  subsequently  alkylated  with  20  mM  iodoacet- 
amide.  Protein  was  then  precipitated  by  the  addition  of  3  vol¬ 
umes  of  a  solution  containing  50%  acetone,  49.9%  ethanol,  and 
0.1%  acetic  acid;  the  sample  was  kept  on  ice  for  15  min  and  then 
centrifuged  for  10  min  at  maximum  speed  on  a  bench  top  cen¬ 
trifuge.  The  pellet  containing  precipitated  AID  protein  was 
resuspended  in  40  jutl  of  8  m  urea,  100  mM  Tris-HCl,  pH  8.0.  The 
suspension  was  then  diluted  by  adding  160  pi  of  150  mM  NaCl, 
50  mM  Tris-HCl,  pH  8.0,  and  the  protein  was  digested  with  2  pg 
of  trypsin  (Promega)  overnight.  After  digestion,  0.4%  trifluoro- 
acetic  acid  was  added,  and  the  peptides  were  desalted  using  a 
50-mg  C18-Sep-Pak  cartridge  (Waters).  Phosphopeptides  were 
purified  by  IMAC  as  previously  described  (35).  Purified  phos¬ 
phopeptides  were  analyzed  by  microcapillary-liquid  chroma¬ 
tography-electrospray  ionization-tandem  mass  spectrometry 
on  a  Thermo  Finnigan  LTQ  quadrupole  ion  trap  mass  spec¬ 
trometer,  as  described  (35).  For  data  analysis,  the  SFQUFST 
(version  3.4  beta  2)  program  running  on  a  Sorcerer  system 
(SageN,  San  Jose,  CA)  was  used  for  peptide  identification.  A 
data  base  search  was  performed  using  a  sub-data  base  consist¬ 
ing  of  50  budding  yeast  proteins  in  addition  to  the  human  AID 
sequence.  The  following  variable  modifications  were  consid¬ 
ered:  +80  Da  (phosphorylation)  for  serine,  threonine  and  tyro¬ 
sine  residues;  +16  Da  (oxidation)  for  methionine  residues.  Up 
to  four  variable  modifications  were  allowed  per  peptide,  and  the 
peptide  mass  tolerance  used  was  3  Da.  A  semitryptic  restriction 
was  applied,  and  only  the  top-matched  peptides  with  a  proba¬ 
bility  score  above  0.9  were  subsequently  considered  for  close 
inspection.  Each  MS/MS  spectrum  that  led  to  a  phosphopep- 
tide  identification  was  manually  verified  to  confirm  that  all  sig¬ 
nificant  ions  were  accounted  for  and  then  validated. 

Measurements  of  Deamination-specific  Activity  on  ssDNA— 
Specific  activities  of  WT  AID  and  phosphorylation  mutants  were 
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measured  using  ^^P-labeled  Sb-nt  ssDNA  S'-AGAAA- 
AGGGGAAAGCAAAGAGGAAAGGTGAGGAGGT-3'. 

Reactions  were  carried  out  in  a  buffer  containing  50  mM 
HEPES  (pH  7.5),  1  mM  dithiothreitol,  10  mM  MgCl2  in  the  pres¬ 
ence  of  500  fmol  of  the  substrate  DNA,  200  fmol  of  recombi¬ 
nant  GST- AID  (.^-expressed)  or  6400  fmol  ofE.  coli  expressed 
GST-AID,  and  20  ng  of  RNase  A.  Eollowing  incubation  at  37  °C 
for  5  min,  the  reactions  were  quenched  by  a  double  extraction 
with  phenol/chloroform/isoamyl  alcohol  (25:24:1),  and  the 
deamination  product  was  analyzed  as  described  previously  (6). 
Specific  activities  were  calculated  as  amount  (fmol)  of  deami- 
nated  substrate/min/jULg  of  enzyme. 

Deamination  Activity  ofWT  and  Mutant  AID  on  Transcribed 
dsDNA — Transcription-dependent  AID  deamination  on  the 
nontranscribed  strand  of  dsDNA  was  measured  using  a  linear 
dsDNA,  undergoing  active  transcription  by  T7  RNA  polymer¬ 
ase  (10).  In  a  typical  reaction  (50  jutl),  GST-AID  (7  pmol  of  Sfi- 
expressed  WT  or  mutant  AID  or  50  pmol  of  E,  co//-expressed 
AID),  RNase  A  (200  ng),  dsDNA  substrate  (1  pmol),  and  T7 
RNA  polymerase  (1  jutl)  in  a  reaction  buffer  containing  HEPES 
(50  mM,  pH  7.5),  dithiothreitol  (5  mM),  MgCl2  (10  mM),  and 
NTPs  (250  ijlm)  were  incubated  at  37  °C  for  30  min.  E.  coli  SSB 
or  human  RPA  (3,  6,  or  12  pmol),  if  present,  were  added  to  the 
reactions  as  indicated.  The  reactions  were  stopped  by  extract¬ 
ing  twice  with  phenol/chloroform/isoamyl  alcohol  (25:24:1). 
The  deaminated  products  were  analyzed  using  a  primer  elon- 
gation-dideoxynucleotide  termination  assay  (10).  Thermo 
Sequenase  (U.S.  Biochemical  Corp.)  was  used  to  extend  an 
18-mer  ^^P-labeled  primer  annealed  to  the  target  strand  in  the 
presence  of  three  dNTPs  and  either  ddATP  or  ddGTP  (80  ijlm 
each).  The  reactions  were  carried  out  for  seven  cycles  (95  °C  for 
30  s,  55  °C  for  45  s,  72  °C  for  1  min)  and  terminated  by  adding  an 
equal  volume  of  stop  solution  containing  95%  formamide  and 
20  mM  EDTA.  The  reaction  products  were  resolved  by  19% 
polyacrylamide  denaturing  gel  electrophoresis  and  analyzed  by 
phosphorimaging.  Deamination  efficiencies  were  calculated 
from  extension  reactions  with  the  ddA  mix  as  a  ratio  of  the  band 
intensity  opposite  the  converted  U  template  site  compared  with 
integrated  band  intensities  opposite  and  beyond  the  C  template 
site.  The  efficiencies  were  also  calculated  from  extension  reac¬ 
tions  with  ddG  as  a  ratio  of  integrated  band  intensities  beyond 
the  C  template  site  to  the  integrated  band  intensities  opposite 
and  beyond  the  C  template  site. 

Mutation  Analysis  of  AID-targeted  C  Deamination  in  Vitro — 
Deamination  specificities  of  WT  and  mutant  AID  were  meas¬ 
ured  using  the  following  reaction  conditions:  30-jULl  volume,  50 
mM  HEPES  (pH  7.5),  dithiothreitol  (1  mM),  MgCl2  (10  mM), 
gapped  DNA  (500  ng),  RNase  A  (200  ng),  and  WT  or  mutant 
AID  (50  to  100  ng  of  .^-expressed  AID  or  1  jutg  of  E.  colTex- 
pressed  AID).  Eollowing  incubations  for  2.5,  5,  and  10  min  at 
37  °C,  the  reactions  were  quenched  by  a  double  extraction  with 
phenol/chloroform/isoamyl  alcohol  (25:24:1).  Conversions  of 
C  — >  U  on  the  DNA  substrate  were  detected  as  white  or  light 
blue  plaques,  indicating  C  — >  T  mutations  in  a  lacZa  target  gene 
after  transfection  into  uracil  glycosylase-deficient  {ung~) 
E.  coli,  as  previously  described  (10). 

WT  and  mutant  AID  deamination  spectra  were  compared 
using  tests  with  the  null  hypothesis  that  the  distribution  of 


deaminations  at  the  nucleotide  positions  is  the  same  for  two 
enzymes.  We  consider  a  contingency  table  where  the  two  col¬ 
umns  are  the  WT  and  mutant  AID,  the  rows  are  the  nucleotide 
sites,  and  the  numbers  in  the  cells  are  the  deaminations  corre¬ 
sponding  to  that  column  and  row.  In  order  to  deal  with  the 
problem  of  small  denominators,  we  only  consider  those  sites  for 
which  the  sum  of  the  deaminations  for  the  two  enzymes  is  5  or 
greater  (we  have  tried  other  thresholds  and  reached  the  same 
conclusions). 

In  order  to  determine  whether  a  motif  is  deaminated  differ¬ 
ently  by  the  WT  or  mutant  enzymes,  we  performed  the  follow¬ 
ing  test.  Eor  each  WT  or  mutant  enzyme  (S38A,  S41A,  S43A,  or 
S43P),  we  calculated  the  “site  mutability  index”  by  dividing  the 
number  of  deaminations  at  each  site  by  the  total  number  of 
deaminations.  Eor  the  set  of  sites  sharing  the  motif,  we  com¬ 
pared  the  WT  index  with  each  of  the  four  mutant  AID  indexes 
by  the  paired  Wilcox  test.  We  used  all  four  mutants,  since  their 
spectra  are  similar  to  each  other  (but  not  the  WT),  and  there  are 
a  relatively  small  number  of  sites  for  each  motif.  Since  we 
repeated  this  test  for  each  of  the  16  motifs,  the  p  values  in  Table 
2  have  been  corrected  for  multiple  tests.  The  reported  value  is 
the  probability  that  if  16  independent  tests  have  been  per¬ 
formed,  then  at  least  one  will  be  as  extreme  as  observed:  c  =  1  — 
(1  —  where  c  is  the  correction  of p  from  the  Wilcox  test. 

Processivity  Deamination  Analysis  of  WT  and  Mutant  AID— 
An  85-nt  ssDNA  substrate  with  two  identical  hot  spot  AGC 
motifs,  containing  a  fluorescein  tag  located  between  the  two 
motifs,  was  used.  Deamination  reactions  (45-jULl  volume)  were 
carried  out  in  the  presence  of  15  pmol  of  ssDNA  substrate,  5-15 
pmol  of  WT  or  mutant  GST-AID,  and  200  ng  of  RNase  A  for 
2.5,  5,  and  10  min.  After  treatment  with  4  units  of  uracil  DNA 
glycosylase  (New  England  Biolabs)  and  alkali,  the  deamination 
products  were  separated  on  a  16%  denaturing  polyacrylamide 
gel,  visualized,  and  quantified  using  a  EX  fluorescence  scanner 
(Bio-Rad).  Analysis  of  processive  deamination  by  a  single  WT 
and  mutant  AID  enzyme  was  carried  out  as  described  (36,  37). 
The  single  deamination  rates  are  approximately  equal  at  the 
5'-C  and  3'-C  target  motifs.  The  correlated  double  deamina¬ 
tion  efficiency  is  calculated  as  the  probability  that  a  single  AID 
molecule  deaminates  both  5'  and  3'  target  motifs  on  one 
ssDNA  substrate  during  a  single  enzyme-DNA  encounter. 

RESULTS 

Human  AID  Expressed  in  Sf9  Cells  Is  Partially  Phosphoryl- 
ated  at  Ser  Residues  38,  41,  and  43  and  at  Thr^^ — Mass  spectral 
analysis  demonstrates  that  WT  recombinant  AID  expressed  in 
SJ9  insect  cells  is  phosphorylated  at  four  predicted  protein 
kinase  motifs;  two  motifs  are  the  previously  identified  Ser^^  and 
Thr^^  residues  (22,  24),  and  two  newly  identified  residues  are  at 
Ser^^  and  Ser^^  (Eig.  1).  Individual  peptide  fragments  of  AID 
showed  that  just  one  of  the  four  residues  was  phosphorylated. 
Multiple  phosphorylations  of  individual  proteins  were  not 
detected  despite  using  an  isolation  technique  that  strongly 
favored  the  detection  of  multiply  phosphorylated  peptide  frag¬ 
ments  (Eig.  1).  These  three  Ser  residues  38,  41,  and  43  along 
with  Thr^^  are  predicted  to  be  phosphorylation  sites  for  protein 
kinase  PKA  or  CAMK2,  protein  kinase  C8,  CKl,  and  PKA, 
respectively  (Web  Scansite  2.0,  medium  stringency  scan  (38)). 
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FIGURE  1 .  Human  AID  expressed  in  Sf9  cells  is  phosphorylated  at  four  sites.  Individual  phosphorylation  at 
threonine  27  and  serines  38,  41,  and  43  was  identified  by  mass  spectrometry  of  tryptic  phosphopeptides 
purified  by  IMAC.  MS/MS  spectra  of  four  distinct  doubly  charged  phosphopeptides  of  AID  are  shown.  In  each 
case,  the  phosphorylated  residue  identified  is  indicated  in  the  sequence  (pS  or  pT)  and  is  followed  by  a  number 
referring  to  the  amino  acid  position  in  the  protein.  In  the  spectrum,  the  peaks  corresponding  to  the  detected 
b-ions  (N-terminal  fragment)  and  y-ions  (C-terminal  fragments)  are  labeled  accordingly.  Also,  peaks  corre¬ 
sponding  to  the  neutral  loss  of  water  or  phosphate  are  labeled.  Only  monophosphorylated  peptides  were 
identified. 
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FIGURE  2.  Specific  activity  of  WT  and  mutant  AID  proteins  expressed  in 
Sf9  insect  cells  or  in  E,  colL  Deamination-specific  activities  were  measured 
using  36  nt  of  ^^P-labeled  ssDNA  containing  a  single  C  target  in  an  AGC  hot 
spot  motif.  Reactions  were  carried  out  for  5  min  at  37  °C  in  the  presence  of 
ssDNA  (500  fmol)  and  recombinant  GST-AID  (200  fmol)  expressed  in  Sf9  cells 
or  with  of  GST-AID  (6400  fmol)  expressed  in  E  coli.  Due  to  its  extremely  low 
specific  activity,  the  E  co//-expressed  AID  was  present  at  ~32-fold  higher  con¬ 
centration  than  Sf9-expressed  AID.  The  error  bars  represent  ±1  S.E.  from  at 
least  three  independent  measurements. 

Based  on  the  amount  of  protein  used  in  the  analysis,  we  esti¬ 
mate  that  the  phosphorylated  species  represents  less  than  a  few 
percent  of  the  total  AID.  The  detection  of  phosphorylated  AID 
required  that  the  enzyme  isolation  and  purification  be  carried 
out  in  the  presence  of  a  mixture  of  phosphatase  inhibitors  (see 
“Experimental  Procedures”)-  When  phosphatase  inhibitors  are 
omitted  from  the  lysis  and  purification  buffers,  we  were  unable 


to  detect  any  AID  phosphorylation 
by  mass  spectral  analysis  (data  not 
shown).  Phosphorylation  was  also 
not  detected  for  human  AID 
expressed  in  E.  coli  (data  not 
shown). 

Deamination  of  ssDNA  by  Phos¬ 
phorylation-defective  AID  Mu¬ 
tants — To  address  whether  phos¬ 
phorylation  might  regulate  AID 
activity,  we  constructed  single 
mutants  at  residues  38,  41,  or  43  by 
replacing  Ser  with  Ala  {Le.  “phos¬ 
phorylation  null”  mutants)  or  by 
replacing  Ser  with  Asp,  which  con¬ 
fers  a  —  1  charge.  Deamination  mea¬ 
surements  were  made  in  the  linear 
range  of  reaction  times  and  protein 
concentrations,  using  the  same 
protein  purification  protocol  (see 
“Experimental  Procedures”). 

Each  AID  mutant  expressed  in 
insect  cells  is  active  on  ssDNA, 
behaving  similarly  to  WT  AID 
within  a  factor  of  3  (Eig.  2).  S43A  has 
about  the  same  specific  activity  as 
WT  AID,  whereas  S38A  and  S41A 
are  roughly  2-fold  less  active.  The 
S38D  and  S43D  mutants  are  similar 
to  WT  AID,  whereas  S41D  and  S43P 
are  about  2.5-fold  less  active.  In  contrast,  human  AID  expressed 
in  E.  coli  is  ~  100-fold  less  active  than  .^-expressed  AID  (Eig. 
2).  The  much  lower  specific  activity  of  £.  co//-expressed  AID  is 
not  likely  to  be  caused  by  the  absence  of  phosphorylation.  Nota¬ 
bly,  .^-expressed  AID  purified  in  the  absence  of  phosphatase 
inhibitors  showed  no  detectable  amount  of  phosphorylation 
but  behaved  indistinguishably  from  phosphorylated  AID  with 
regard  to  specific  activity  and  in  each  biochemical  measure¬ 
ment  performed  below  (data  not  shown).  We  conclude  that  the 
ability  of  AID  to  deaminate  ssDNA  does  not  depend  on  phos¬ 
phorylation;  nor  does  the  activity  appear  to  require  charged 
amino  acid  residues  at  positions  38,  41,  and  43. 

Substantial  Changes  in  WT  AID  Deamination-induced 
Mutation  Spectra  Are  Caused  by  Amino  Acid  Substitutions  at 
Serines  38,  41,  and  43 — AID  was  incubated  in  vitro  with  phage 
M13mp2,  a  circular  DNA  substrate  containing  a  lacZa  reporter 
target  sequence  in  a  365-nt  single-stranded  gapped  region  (Eig. 
3a).  Deaminations  are  detected  as  C  — >  T  mutations  occurring 
in  DNA  isolated  from  individual  phage  progeny  following 
transfection  of  AID-treated  DNA  into  ung~  E.  coli  (10).  The 
ratio  of  mutant  phage  (clear  or  light  blue  plaques)  to  wild  type 
phage  (dark  blue  plaques)  is  <5-7%,  which  ensures  that  each 
mutant  phage  reflects  the  action  of  a  single  AID  molecule  acting 
on  M13  DNA  (Table  1).  Conditions  that  limit  M13  DNA  tar¬ 
geting  to  a  single  AID  molecule  are  satisfied  at  short  (2.5-min) 
and  long  (10-min)  incubations  with  AID.  We  have  verified  that 
C  — >  T  mutations  are  absent  in  DNA  isolated  from  wild  type 
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FIGURE  3.  Deamination  spectra  of  WT  AID,  S38A,  and  S43P  phosphorylation-null  mutant  AID.  a,  sketch  of 
a  protocol  to  detect  C  ^T  mutations  resulting  from  AID-catalyzed  C  deaminations  occurring  on  individual 
ssDNA  substrate  molecules  in  a  gapped  Ml  3  DNA  construct. The  ssDNA  gap  region  is  365  nucleotides  long  and 
contains  230  nucleotides  of  a  lacZa  reporter  sequence.  Mutations  in  lacZa  are  identified  as  clear  or  light  blue 
plaques;  nonmutated  phage  appear  as  dark  blue  plaques  (see  "Experimental  Procedures").  b,C^\J  deamina¬ 
tion  spectra  and  the  identification  of  C  ^T  mutations  in  DNA  clones  after  incubation  with  WT,  S38A,  and  S43P 
AID  for  2.5  and  10  min  were  obtained  by  sequencing  about  50  individual  mutant  clones.  Each  colored  bar 
represents  a  percentage  of  mutated  phage  clones  with  a  C  ^T  mutation  at  the  indicated  position  on  the  lacZa 
target  sequence  (-215  to  +149).  Red  bars  identify  C  deaminations  occurring  in  5'-WRC  hot  spot  motifs,  blue 
bars  represent  5'-SYC  cold  spot  motifs,  and  green  bars  represent  intermediate  motifs  (neither  WRC  nor  SYC). 
WT  -  S38A  and  WT  -  S43P  "difference  spectra"  at  2.5  min  and  10  min  represent  side-by-side  comparison  of 
deamination  spectra  of  WT  and  S38A  and  WT  and  S43P,  respectively.  The  bars  in  the  difference  spectra  repre¬ 
sent  the  differences  in  deamination  frequencies  at  each  deaminated  site  by  in  the  mutant  AID  spectrum 
compared  with  the  ones  of  WT  (deamination  frequencies  in  the  mutant  AID  spectrum  minus  deamination 
frequencies  of  WT).  Bars  below  0  represent  excess  deaminations  for  WT  AID,  and  bars  above  0  denote  excess 
mutant  AID  deaminations. 


phage  by  direct  sequencing  of  individual  DNA  clones  isolated 
from  dark  blue  plaques  (10). 

There  are  pronounced  differences  in  C  — >  T  mutational  spec¬ 
tra  engendered  by  differences  in  the  deamination  patterns  com¬ 
paring  WT  AID  with  S38A  and  S43P  AID  mutants,  whereas  the 
mutants  closely  resemble  each  other  (Fig.  3b).  These  differ¬ 
ences  occur  although  the  WT  and  mutant  enzymes  clearly  favor 
deaminations  in  WRC  hot  spot  motifs  and  disfavor  deamina¬ 
tions  in  SYC  cold  spot  motifs  (Table  2).  For  both  WT  and 


mutant  AID,  the  shorter  incubation 
period  (2.5  min)  shows  a  combina¬ 
tion  of  strongly  favored  deamina¬ 
tions  in  WRC  hot  spot  motifs  (Fig. 
3b  {red  bars),  Table  2,  and  supple¬ 
mental  Fig.  1)  relative  to  SYC  cold 
spot  and  intermediate  deamination 
motifs  (Fig.  3b,  blue  bars  and  green 
bars,  respectively).  A  comparison  of 
WT  and  mutant  AID  spatial  deami¬ 
nation  patterns  can  be  made  by 
dividing  the  365-nt  ssDNA  gapped 
region  into  four  regions:  R^^  (nucle¬ 
otide  positions  —215  to  —159),  R2 
(-158  to  -45),  R3  (-44  to  +79), 
and  R4  (+80  to  +147).  There  is  a 
large  disparity  in  the  clonal  deami¬ 
nation  patterns  in  R^,  R2,  or  R4, 
comparing  mutant  S38A  and  S43P 
with  WT  AID  (Fig.  3b). 

To  evaluate  the  spatial  differ¬ 
ences  in  deamination  quantita¬ 
tively,  we  have  computed  a  differ¬ 
ence  spectrum  (Fig.  3b,  A)  by 
subtracting  the  frequency  of  deami¬ 
nated  clones  in  the  WT  AID  spectra 
from  those  of  S38A  and  S43P  AID  at 
every  target  C  site  in  the  365-nt  gap 
(Fig.  3b).  Lines  extending  below  0 
represent  excess  deaminations  for 
WT  AID,  and  lines  above  0  denote 
excess  S38A  and  S43P  mutant  AID 
deaminations  (Fig.  3b).  The  differ¬ 
ence  spectra  reveal  a  much  higher 
probability  of  WT  AID-catalyzed 
deaminations,  especially  in  WRC 
motifs,  occurring  in  R^,  somewhat 
less  so  in  R3,  and  much  more  so  in 
R4.  In  contrast,  mutant  AID-cata¬ 
lyzed  deaminations  are  clearly 
favored  in  R2  (Fig.  3b).  A  test  (see 
“Experimental  Procedures”)  pro¬ 
vides  strong  statistical  support  that 
the  WT  and  mutant  spectra  are  sig¬ 
nificantly  different:  at  2.5  min,  S38A 
{p  <  2.2  X  10“^^),  S43P  ip  <  2.6  X 
10“^),  S41A  ip  <  2.2  X  10“^^), 
S43A  (p  <  2.2  X  10“^^);  at  10  min, 
S38A  (p<2.2X  10“^^),  S43P  (p  < 
7.4  X  10“^),  S41A  {p  <  4.6  X  10“^),  S43A  {p  <  2.2  X  10“^^). 

For  WT  AID,  there  are  numerous  mutations  in  WRC  motifs 
(Fig.  3b,  red  bars)  in  the  R^  region,  with  about  50%  of  the  clones 
having  mutations  at  C  template  position  —184  and  '~18-22% 
of  the  clones  with  mutations  at  positions  —209,  —166,  and 
— 160  (Fig.  3b).  Notably,  S38A  and  S43P  AID  poorly  deaminate 
WRC  motifs  at  sites  —209  and  —184,  which  correspond  to  the 
two  favored  hot  spot  sites  in  R^  in  the  wild  type  AID  spectrum. 
A  second  difference  is  that  in  the  beginning  of  R2,  a  cluster  of 
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FIGURE  3 — continued 


TABLE  1 

Time  course  analysis  of  deamination  by  wild-type  and  mutant  AID 
on  ssDNA  gap  substrate 


Average  number 
of  mutations 
per  clone 

Mutant  frequency 

(xlO-0" 

2.5  min 

10  min 

2.5  min 

5  min 

10  min 

.^-expressed 

WT  AID 

13 

20 

3.8 

5.1 

5.7 

S38A 

9 

16 

3.7 

4.7 

6.1 

S41A 

11 

15 

2.7 

3.4 

4.3 

S43A 

16 

24 

4.6 

4.7 

5.5 

S43P 

10 

20 

1.7 

2.2 

2.5 

E.  coli-expressed  WT  AID 

ND^ 

17 

0.9 

1.4 

1.5 

The  background  mutant  frequency  for  gapped  substrate  in  the  absence  of  AID 
is  —0.07  X  10“^.  E.  coli-expressed  AID  was  present  at  10  -20-fold  higher  levels 
(1  /xg)  compared  with  .^-expressed  AIDs  (50-100  ng). 

^  Not  determined. 

template  Cs  with  SYC  cold  spots  (Fig.  3b,  blue  bars)  and  inter¬ 
mediate  motifs  (Fig.  3b,  green  bars)  are  deaminated  at  signifi¬ 
cant  frequencies  (10  -30%).  Deaminations  at  these  sites  are  vir¬ 
tually  absent  in  the  WT  AID  spectrum  (Fig.  3b),  A  major 
difference  between  WT  and  S38A  and  S43P  AID  occurs  in  R4, 
where  deaminations  in  WRC  motifs  (Fig.  3b,  red  bars)  are  pres¬ 
ent  in  a  much  higher  fraction  in  the  WT  spectra  compared  with 


S38A  and  S43P  (Fig.  3b),  Differ¬ 
ences  in  WT  versus  S38A  and  S43P 
AID  spectra  also  occur  in  R3  but  are 
far  less  pronounced. 

A  mutability  index  (MI),  which 
denotes  the  experimentally  meas¬ 
ured  mutation  frequency  for  a  par¬ 
ticular  sequence  motif  compared 
with  the  expected  unbiased  muta¬ 
tion  frequency  (39),  has  been  used 
as  a  quantitative  measure  for  AID 
deamination  specificity  (10).  MI  val¬ 
ues  for  the  12  trinucleotide  motifs 
for  WT,  S38A,  and  S43P  AID  have 
been  tabulated  (Table  2).  On  aver¬ 
age,  WT  and  mutant  AID  exhibit 
the  same  general  deamination  pref¬ 
erences,  strongly  favoring  hot  spots 
(WRC)  over  cold  spots  (SYC),  with 
intermediate  motifs  in  between 
(Table  2,  MI  average). 

The  large  disparity  in  the  WT  and 
mutant  AID  spectra  may  be  caused 
by  differences  in  the  distribution 
of  individual  intermediate  triplet 
motifs  in  the  lacZa  regions  and 
their  mutability  by  WT  and  mutant 
AID.  For  example,  in  the  R2  region, 
which  strongly  favors  S38A  and 
S43P  over  WT  AID  deaminations, 
the  intermediate  motifs  CGC  and 
GGC  are  each  represented  five  and 
four  times,  respectively.  These  two 
intermediate  motifs  have  higher 
mutability  indexes  for  S38A  and 
S43P  compared  with  WT  AID  by 
factors  of  —2.7  {p  =  0.004)  and  3.4  {p  =  0.0001),  respectively 
(Table  2).  In  contrast,  ACC,  which  is  the  intermediate  motif 
most  strongly  favored  by  WT  over  mutant  AID  by  a  factor  of  3.7 
{p  =  0.02)  (Table  2),  is  absent  from  R2. 

The  spectral  differences  for  WT  and  mutant  AID  persist  for 
the  longer  (10-min)  incubation  period  (Fig.  3b),  The  frequency 
of  deaminated  clones  at  each  site  increases  in  an  approxi¬ 
mately  spatially  uniform  manner  in  all  regions  from  2.5  to  10 
min  for  WT  and  mutant  AID.  The  difference  spectra  com¬ 
paring  WT  with  mutant  AID  for  10-min  incubations  (Fig.  3b) 
show  that  WT  AID  is  far  more  active  in  R^,  R3,  and  R4,  and 
S38A,  S41  A,  S43A,  and  S43P  AID  mutants  are  more  active  in 
R2,  in  accord  with  the  2.5  min  incubation  data  (Fig.  3b  and 
supplemental  Fig.  1). 

A  compilation  of  individual  trinucleotide  motifs  present  in  Ig 
switch  regions  reveals  that  two  of  the  most  highly  represented 
motifs  are  much  more  strongly  favored  for  deamination  by  the 
phosphorylation-null  mutants  compared  with  WT  AID.  The 
GGC  motif  (MI  1.0  S43P,  MI  0.23  WT;  Table  2)  represents  33% 
of  IgA,  19%  of  IgE,  and  15%  of  IgG4  motifs  (Table  3).  The  AGC 
motif  (MI  2.0  S43P,  MI  1.3  WT;  Table  2)  represents  40%  of  IgA, 
14%  of  IgE,  and  11%  of  IgG4  (Table  3).  Notably,  S43P  AID  is 
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TABLE  2 


Three-nucleotide  motif  Ml  for  WT  and  mutant  AID  (S38A  and  S43P) 
at  2.5  min 

The  MI  is  defined  as  the  number  of  times  a  given  trinucleotide  motif  within  a 
segment  of  DNA  contains  a  mutation,  divided  by  the  number  of  times  the  oligonu¬ 
cleotide  would  be  expected  to  be  mutated  for  a  mechanism  with  no  sequence  bias. 

Motif 

MI 

WT 

S38A 

S43P 

p  value"* 

Hot  spots 

AAC 

2.5 

1.5 

1.6 

0.41 

AGC 

1.3 

2.1 

2.0 

0.06 

TAG 

3.4 

2.0 

2.2 

0.22 

TGC 

2.6 

2.2 

2.6 

0.99 

MI  average  (WRC) 

2.5 

1.9 

2.1 

Cold  spots 

CCC 

0.28 

0.52 

0.44 

0.65 

CTC 

0.45 

0.76 

0.53 

0.99 

GCC 

0.05 

0.18 

0.05 

0.07 

GTC 

0.32 

0.67 

0.00 

0.99 

MI  average  (SYC) 

0.28 

0.53 

0.26 

Intermediates 

ACC 

1.2 

0.32 

0.35 

0.02 

ATC 

1.0 

0.63 

0.71 

0.97 

CAC 

1.4 

0.91 

0.96 

0.31 

CGC 

0.5 

1.3 

1.4 

0.004 

GAC 

0.24 

0.5 

0.35 

0.79 

GGC 

0.23 

0.78 

1.0 

0.0001 

TCC 

0.48 

1.1 

1.1 

0.75 

TTC 

0.53 

0.71 

0.71 

0.46 

MI  average 

0.70 

0.78 

0.82 

values  were  calculated 

by  the  paired  Wilcox  test  (see  “Experimental 

Procedures”). 

TABLE  3 

Distribution  of  trinucleotide  motifs  in  Ig  switch  regions 

The  number  of  trinucleotide  motifs  was  calculated  for  the  nontranscribed  strand  of 

IgA  (mouse  germ  line  IgA  switch  region, 

accession 

number  gi:194415),  IgE  (human 

IgE  switch  region,  accession 

number  gi:32983),  and  IgG  (human  Ig  y-4  switch 

region,  accession  number  gi:31510). 

Number  of  motifs 

IgA 

IgE 

IgG 

Hot  spots 

AAC 

6 

37 

24 

AGC 

79 

90 

148 

TAC 

4 

10 

7 

TGC 

7 

30 

83 

Cold  spots 

CCC 

0 

49 

122 

CTC 

1 

31 

106 

GCC 

2 

54 

135 

GTC 

5 

17 

44 

Intermediates 

ACC 

4 

42 

74 

ATC 

2 

20 

21 

CAC 

2 

34 

97 

CGC 

3 

11 

30 

GAC 

15 

62 

79 

GGC 

65 

126 

196 

TCC 

2 

30 

108 

TTC 

0 

22 

57 

present  in  a  patient  with  hyper-IgM  syndrome  characterized  by 
the  absence  of  CSR  (32). 

Wild  Type  and  Mutant  AID  Act  Processively  on  ssDNA— The 
average  numbers  of  C  — >  T  mutations  were  between  9  and  16 
for  a  2.5'min  incubation  of  the  M13  gapped  substrate  with 
either  WT  AID  or  each  of  the  four  AID  mutants  and  were 
between  15  and  24  for  a  lO-min  incubation  (Table  1).  At  the 
lO-min  time  point,  roughly  two-thirds  of  the  clones  acted  on  by 
WT  AID  had  more  than  10  mutations,  with  some  clones  con¬ 
taining  60-70  mutations  (supplemental  Fig.  2).  A  similar  frac¬ 


tion  of  clones  containing  >  10  mutations  is  observed  for  S38 A 
AID  (supplemental  Fig.  2),  although  compared  with  WT  AID, 
this  mutant  produces  many  more  clones,  about  50%,  with 
10-20  mutations  and  a  smaller  number  of  clones  having  >20 
mutations.  Since  more  than  90%  of  the  DNA  exposed  to  AID 
has  no  mutations,  as  determined  by  sequencing  DNA  isolated 
from  individual  blue  plaques  (see  Ref.  10),  both  WT  and  mutant 
AID  act  processively. 

An  ssDNA  oligonucleotide,  containing  a  fluorescein  tag  {F) 
located  between  the  two  identical  AGC  hot  spot  motifs,  has 
been  used  to  detect  multiple  deaminations  occurring  on  the 
same  ssDNA  molecule  (36, 37)  {Le.  a  single  conversion  of  C  — >  U 
in  either  the  3 '-motif  or  5 '-motif  or  double  C  — >  U  conversions 
occurring  in  both  3'  -  and  5 '-motifs)  (Fig.  4a),  AID  acts  bidirec¬ 
tionally,  resulting  in  roughly  equal  deamination  frequencies  at 
the  3'  and  5'  targets  on  the  same  DNA  (Fig.  4a), 

The  “processivity  factor”  is  calculated  as  the  ratio  of  the 
observed  fraction  of  double  deaminations  occurring  at  both 
5'-C  and  3'-C  on  the  same  ssDNA  substrate  to  the  predicted 
fraction  of  independent  double  deaminations  (36).  Processivity 
factors  between  3.5  and  9  for  WT  AID  and  AID  mutants  (Fig.  4, 
a-c)  confirm  that  processive  double  deamination  is  well  in 
excess  of  double  deamination  attributable  to  random  encoun¬ 
ters  with  the  same  ssDNA  substrate  by  two  AID  molecules.  The 
reduction  in  processivity  factors  with  time  reflects  an  increased 
probability  of  multiple  enzyme-DNA  encounters  when  a  larger 
percentage  of  substrates  are  deaminated  (Fig.  4,  b  and  c,  5-  and 
10-min  incubations).  The  average  correlated  double  deamina¬ 
tion  efficiency  for  WT  and  mutant  AID  is  —30%  (Fig.  4d)  {Le, 
70%  of  deaminations  catalyzed  by  a  single  AID  molecule  occur 
just  once,  at  either  the  3'  or  5'  target  motif,  and  30%  occur 
processively  at  both  3'  and  5'  target  motifs).  The  data  demon¬ 
strate  that  the  WT  and  mutant  forms  of  AID  are  processive  to 
about  the  same  extent.  Significantly,  the  processive  action  of 
AID  seems  unaffected  by  amino  acid  substitutions  that  elimi¬ 
nate  the  possibility  of  phosphorylation  at  sites  38,  41,  and  43. 

AID-catalyzed  Transcription-dependent  C  Deamination 
Occurs  in  the  Absence  of  Phosphorylation  and  RPA — We  have 
examined  how  phosphorylation  of  AID  affects  its  ability  to 
deaminate  C  during  active  transcription  of  dsDNA  (Fig.  5).  T7 
RNA  polymerase  is  used  to  transcribe  linear  dsDNA  having  a 
single  AGC  hot  spot  motif  on  the  nontranscribed  strand.  The 
deamination  of  C  to  U  on  the  nontranscribed  strand  is  detected 
by  measuring  the  termination  of  primer  elongation  caused  by 
incorporation  of  ddAMP  opposite  U  and  ddGMP  opposite  C 
(see  “Experimental  Procedures”).  Two  primer  elongation  reac¬ 
tions  are  performed  simultaneously,  where  ddATP  is  substi¬ 
tuted  for  dATP  (Fig.  Sa^  ddA  lanes)  and  ddGTP  is  substituted 
for  dGTP  (Fig.  5<^,  ddG  lanes).  The  conversion  of  C  — >  U  is 
detected  by  the  appearance  of  a  lower  doublet  termination  band 
corresponding  to  the  incorporation  of  ddAMP,  just  below  the  C 
template  position,  in  the  ddA  lanes  (Fig.  5a,  arrow,  ddA  lanes  5, 
7,9, 11, 13,  IS)  and  by  the  presence  of  bands  migrating  past  the 
C  template  site  in  the  ddG  lanes  (Fig.  5a,  bracket  to  the  right  of 
the  gels,  ddG  lanes  6,  8, 10, 12, 14,  and  16),  The  upper  doublet 
band  in  the  ddA  lanes  occurs  by  incorporating  dGMP  opposite 
residual  C  moieties  not  acted  on  by  AID  (Fig.  5a),  AID-cata¬ 
lyzed  deamination  occurs  only  when  the  dsDNA  undergoes 
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FIGURE  4.  Analysis  of  processive  deamination  by  WT  and  mutant  AID  on  the  same  oligonucleotide  ssDNA 
molecule,  a,  deamination  by  WT  AID  and  S38A,  S41  A,  S43A,  and  S43P  phosphorylation-null  AID  mutants  on  an 
ssDNA  substrate  with  two  C  targets.  An  ssDNA  substrate  (85  nt)  with  two  identical  C  deamination  hot  spot 
motifs  (AGO,  containing  a  fluorescein  (F)  reporter  molecule  located  between  the  two  motifs  {sketch  at  the  top), 
is  used  to  measure  single  and  double  deaminations  occurring  on  an  individual  ssDNA  substrate.  Single  deami¬ 
nations  of  the  5'-C  and  3'-C  are  detected  as  fluorescein-labeled  67-  and  48-nt  fragments,  5'C  and  3'C,  respec¬ 
tively;  double  deaminations  of  both  Cs  on  the  same  molecule  are  detected  as  a  30-nt  fragment  (5'C&  3'0.  The 
integrated  gel  band  intensities  are  used  to  determine  the  fraction  of  correlated  double  deaminations,  gener¬ 
ated  by  the  processive  action  of  a  single  AID  molecule  (see  "Experimental  Procedures"),  b  and  c,  deamination 
(percentage)  of  substrates  containing  single  deaminations  at  either  5'-C  ( ♦ )  or  3'-C  (■)  and  double  deamina¬ 
tions  at  both  5'-C  and  3'-C  (A)  obtained  from  the  integrated  gel  band  intensities  in  a  for  WT  AID  and  for  S38A 
mutant  AID.  #,  the  calculated  fraction  of  independent  double  deaminations  {Independent),  caused  by  two 
molecules  of  AID  attacking  each  site  independently.  The  processivity  factor,  defined  as  the  ratio  of  the 
observed  double  deaminations  occurring  at  both  5'-C  and  3'-C  on  an  individual  ssDNA  to  the  calculated 
fraction  of  independent  double  deaminations  (36),  is  shown  at  the  bottom  of  each  graph.  The  processivity 
factor  for  both  WT  and  mutant  AID  is  significantly  greater  than  3,  indicating  that  the  majority  of  double 
deaminations  are  caused  by  a  single  AID  molecule  deaminating  both  target  C  residues,  d,  calculated  average 
correlated  deamination  efficiency  for  WT  AID  and  AID  mutants.  The  "correlated  deamination"  efficiency  is 
the  probability  for  a  single  enzyme  to  perform  double  correlated  deaminations  at  both  3'-C  and  5'-C  upon  a 
single  encounter  with  an  ssDNA  substrate.  Error  bars,  ±  1  S.E. 


active  transcription  in  the  presence  of  T7  RNA  polymerase 
(Fig.  5a,  lanes  5  -  16),  whereas  no  deamination  is  detected  in 
the  absence  of  either  AID  or  T7  RNA  polymerase  (Fig.  5a, 
lanes  1-4). 

The  activity  of  human  WT  AID  expressed  in  Sfi  cells  is  either 
similar  or  slightly  less  when  compared  with  Ser  — >  Ala  phos¬ 
phorylation-null  mutants  or  with  Ser  — >  Asp  (  —  1  negatively 
charged)  mutants  (Fig.  5b).  The  activity  of  the  HIGM-2  S43P 
mutant  is  indistinguishable  from  WT  AID  (Fig.  5b).  WT  AID 


expressed  in  E.  coli  requires  about  a 
7-fold  higher  protein  concentration 
to  attain  a  similar  deamination  effi¬ 
ciency  on  transcribed  dsDNA  as 
.^-expressed  human  AID  (Fig.  5,  a 
{lanes  IS  and  16)  and  b).  Since  the 
E.  co//-expressed  AID  is  ~  100-fold 
less  active  than  ,y9-expressed  AID 
on  ssDNA  (Fig.  2),  a  smaller  differ¬ 
ence  in  the  relative  activities  of  Sfi 
and  E.  coli  AID  on  transcribed 
dsDNA  might  reflect  the  possibility 
that  the  measured  deamination 
activity  in  this  assay  may  be  depend¬ 
ent  on  a  rate-limiting  formation  of 
the  transcription  bubble  needed  to 
provide  ssDNA  {i.e.  the  nontran- 
scribed  strand)  as  a  substrate  for 
AID. 

AID-catalyzed  transcription- 
dependent  deamination  on  linear 
dsDNA  occurs  in  the  absence  of 
RPA  (Fig.  5).  Deamination  is  inhib¬ 
ited  when  RPA  is  present  at  substoi- 
chiometric  and  higher  levels  (3-12 
pmol)  compared  with  AID  (7  pmol) 
(Fig.  6a).  We  observed  —35-70% 
inhibition  of  transcription-depend¬ 
ent  deamination,  depending  on  the 
level  of  RPA  in  the  reaction,  which  is 
shown  by  a  reduction  in  the  inten¬ 
sity  of  ddA  termination  bands  (Fig. 
6a,  U  arrows)  and  by  a  concomitant 
reduction  in  the  intensities  of  bands 
migrating  beyond  the  target  C  site 
in  the  presence  of  ddGTP  (Fig.  6a, 
ddG  lanes  2, 4,  6,  and  8).  RPA  inhib¬ 
its  the  activities  of  S38A  and  S43P 
AID  mutants  to  about  the  same 
extent  as  WT  AID  (Fig.  6b).  A  sim¬ 
ilar  inhibitory  effect  of  RPA  on 
WT  AID  deamination  is  observed 
using  supercoiled  circular  DNA, 
containing  a  lacZ  target  gene  con¬ 
taining  numerous  hot  spot,  cold 
spot,  and  intermediate  target 
motifs  (Table  4). 

We  investigated  the  effect  of 
human  RPA  and  E.  coli  SSB  on 
AID  activity  on  a  linear  DNA  oligonucleotide  (36-mer,  Fig. 
6c)  and  gapped  (M13  DNA,  Fig.  6d)  substrates.  Both  single- 
stranded  binding  proteins  exert  a  significant  inhibitory 
effect  on  AID  activity.  Increasing  levels  of  either  RPA  or  SSB 
strongly  inhibit  AID  activity  on  the  oligomer  substrate  (Fig.  6c). 
There  is  a  monotonic  reduction  in  C  — >  T  mutant  frequencies 
on  the  gapped  substrate,  decreasing  to  background  mutation 
levels  (mutant  frequency  —0.07-  0.1%)  at  about  40  pmol  of  RPA 
or  SSB,  which  corresponds  to  a  saturating  concentration  of  sin- 
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FIGURE  5.  Deamination  activity  of  WT  and  mutant  AID  proteins  on  a  transcribed  linear  dsDNA.  Top, 
dsDNA  template  with  an  incorporated  T7  promoter  for  transcription  by  T7  RNA  polymerase.  A  single  target  C 
residue  located  on  the  nontranscribed  strands  is  indicated  in  boldface  type  by  a  larger  font  size,  a,  polyacrylam¬ 
ide  gel  electrophoresis  data  showing  C  ^  U  deamination  on  the  nontranscribed  DNA  strand  for  WT  and 
mutant  AID  proteins  expressed  in  Sf9  cells  {lanes  5-14)  and  WT  protein,  expressed  in  E  coll  {Ec-WT,  lanes  15  and 
76).  A  primer  elongation  dideoxynucleotide  termination  assay  (1 0)  (see  "Experimental  Procedures")  was  used 
to  measure  the  conversion  of  C  ^  U.  Deamination  is  measured  by  the  appearance  of  an  intense  termination 
band  just  below  the  template  C  band  (indicated  with  an  arrow  on  the  left  of  each  gel)  when  ddATP  is  used  in 
place  of  dATP  in  the  extension  reactions  {ddA,  lanes  5, 7,  9,  7  7,  13,  and  75).  Concomitant  with  the  conversion  of 
C  to  U  band  in  the  ddA  lanes  is  the  presence  of  bands  migrating  past  the  C  template  site,  denoted  by  the  bracket 
tother/ghf  ofthegel,  when  ddGTP  is  used  instead  ofdGTP  {ddG  Ianes6,8,  W,  72,  and  74).  Deamination  on  the 
dsDNA  substrate  is  not  detected  in  the  absence  of  either  AID  orT7  RNA  polymerase  {lanes  7-4).  The  presence 
of  intact  C  template  is  observed  by  the  absence  of  the  lower  band  Just  belowthe  C  template  position  in  the  ddA 
lanes  {lanes  7  and  3)  and  by  the  absence  of  bands  migrating  beyond  the  C  template  site  in  the  ddG  lanes  {lanes 
2  and  4).  b,  deamination  efficiency  of  WT  and  mutant  AID  on  transcribed  dsDNA.  The  deamination  reactions 
were  carried  out  in  the  presence  of  7-fold  more  (50  pmol)  E  co//-expressed  WT  AID  {Ec-WJ)  compared  with 
Sf9-expressed  AID  (7  pmol). 


gle-stranded  binding  protein  (one  RPA  trimer  or  one  SSB  tet- 
ramer  to  SO-nt  ssDNA;  see  Refs.  40  and  41). 

PKA-phosphorylated  AID  Activity  on  Single-stranded  DNA 
and  on  Transcriptionally  Activated  Double-stranded  DNA— 
Since  phosphorylation  at  Ser^^  has  been  implicated  in  efficient 
CSR  in  vivo  (22-24,  27)  and  in  transcription-dependent  deami¬ 
nation  in  vitro  (22,  27),  we  have  incubated  AID  with  the  cata¬ 


lytic  subunit  of  human  PKA  in  the 
presence  of  y-^^P-labeled  dATP,  to 
determine  if  more  extensive  phos¬ 
phorylation  at  Ser^^,  which  has  a 
PKA  motif,  might  affect  the  bio¬ 
chemical  properties  of  AID.  We 
observed  a  strong  phosphorylation 
signal  corresponding  to  the  electro¬ 
phoretic  migration  of  AID  that  is 
dependent  on  incubation  with  PKA 
(Fig.  la,  right-hand  gel),  and  we  esti¬ 
mate  that  about  50%  of  the  AID  is 
phosphorylated  based  on  the  inten¬ 
sities  of  Coomassie-stained  bands 
corresponding  to  phosphorylated 
and  nonphosphorylated  AID  (Fig. 
la,  left-hand  gel).  Excess  phospho¬ 
rylation  has  no  significant  effect  on 
the  specific  activity  of  AID  acting  on 
ssDNA  (Fig.  lb).  The  same  conclu¬ 
sion  holds  for  transcription-de¬ 
pendent  deamination  of  dsDNA  in 
the  presence  or  absence  of  RPA 
(Fig.  7c). 

DISCUSSION 

AID  resides  principally  in  the 
cytoplasm  of  activated  B  cells  (42, 
43)  but  is  transported  into  the 
nucleus,  where  it  catalyzes  deami¬ 
nation  of  C  — >  U  on  Ig  V-gene 
regions  undergoing  active  tran¬ 
scription.  Approximately  10-15% 
of  AID  expressed  in  B  cells  is 
reported  to  undergo  phosphoryla¬ 
tion  at  Ser^^  (23).  We  have  found 
using  Sf9  insect  cells  to  overexpress 
human  GST- AID  that  a  small  frac¬ 
tion  of  the  enzyme  is  phosphoryla¬ 
ted  at  Ser^^  and  Thr^^  and  at  two 
residues  not  reported  previously, 
Ser^^  and  Ser^^  (Fig.  1).  The  data 
suggest  that  phosphorylation  oc¬ 
curs  on  just  one  of  these  four  posi¬ 
tions  in  a  single  AID  molecule. 
There  is  considerable  interest  in 
understanding  the  biological  roles 
of  phosphorylation  of  AID,  espe¬ 
cially  with  respect  to  the  ability  to 
target  transcriptionally  active  V-re- 
gions  and  switch  regions  selectively 
(22,  27).  The  replacement  of  Ser^^  with  Ala  delays  the  onset  of 
CSR  in  B-cells  (23,  44)  and  has  been  reported  to  abrogate 
transcription-dependent  deamination  on  linear  dsDNA  using  T7 
RNA  polymerase  in  vitro  (22,  27).  The  replacement  of  Ser^^  with 
Pro  engenders  hyper-IgM  syndrome  in  humans  (32). 

In  this  paper,  we  aimed  to  distinguish  between  how  phospho- 
rylation-null  mutants  versus  phosphorylation  per  se  affects  the 
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FIGURE  6.  Effect  of  single-stranded  binding  proteins  (human  RPA  and 
E,  coli  SSB)  on  AID  deamination  activity,  a,  deamination  of  WT  AID  on  the 

transcribed  linear  dsDNA  in  the  presence  of  increasing  amounts  of  RPA. 
Experiments  were  carried  out  as  described  in  the  legend  to  Fig.  5a.  b,  relative 
deamination  efficiencies  on  transcribed  dsDNA  of  WT  and  mutant  S38A  and 
S43P  AID  in  the  presence  of  RPA.  Deamination  activities  in  the  presence  of 
RPA  were  normalized  to  the  activity  of  the  enzymes  in  the  absence  of  RPA. 
c,  RPA  and  SSB  completely  inhibit  activity  of  WT  AID  on  a  fluorescein  (F)- 
labeled  36-nt  ssDNA  substrate  with  a  single  hot-spot  C  (AGQ  target.  Reac¬ 
tions  were  carried  out  in  the  presence  of  7.5  pmol  of  substrate  DNA  (255  pmol 
as  nucleotides)  and  3  pmol  of  WT  AID  for  either  5  or  1 0  min.  RPA  or  SSB,  when 
present,  were  added  at  saturating  concentration  of  10  or  20  pmol.  d,  WT 
AID-catalyzed  deamination  on  the  gapped  ssDNA  substrate  in  the  presence 
of  RPA  and  SSB.  Deamination  activity  is  monitored  by  the  increase  in  fre¬ 
quency  of  mutant  (white  or  light  blue  color)  Ml  3  phage  caused  by  C  to  U 
conversion  in  the  lacZa  target  reporter  gene.  The  background  mutant  fre- 
quencyfor  the  gapped  DNA  in  the  absence  of  AID  is  typically  0.7  X  10“^  (10). 
At  40  pmol,  RPA  and  SSB  saturate  ssDNA  gapped  substrates  (one  RPA  trimer 
or  one  SSB  tetramer  to  —30  nt  of  ssDNA).  Error  bars,  S.D.  values. 


TABLE  4 

Effect  of  RPA  on  AID  activity  on  transcribed  closed  circular  dsDNA 
substrate 


AID  deamination  activity  was  measured  using  a  covalently  closed  circular  dsDNA 
M13mp2T7  substrate,  undergoing  active  transcription  by  T7  RNA  polymerase  (see 
Ref.  11). 


Mutation  frequency  ( X 10  ^) 

No  AID 

0.8  ±  0.3 

WT  AID 

3.7  ±  0.5 

WT  AID  RPA  (60  ng) 

4.7  ±  0.9 

WT  AID  RPA  (300  ng) 

2.5  ±  0.5 

WT  AID  RPA  (1  pLg) 

2.2  ±  0.4 

biochemical  behavior  of  AID.  The  biochemical  properties 
investigated  include  enzyme  specific  activity,  processivity, 
deamination  spectra,  deamination  motif  specificity,  and  trans¬ 
cription-dependent  deamination,  including  the  influence  of 
RPA.  The  behavior  of  insect  cell-expressed  human  AID  acting 
on  ssDNA  and  transcribed  dsDNA  in  vitro  mimic  characteris¬ 
tics  of  SHM  in  B-cells,  including  an  approximate  simulation  of 
hot  spot  and  cold  spot  C  ^  T  mutational  patterns  (10).  Conse- 
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FIGURE  7.  Effect  of  in  vitro  phosphorylation  on  AID.  a,  efficient  phospho¬ 
rylation  of  human  AID  by  recombinant  catalytic  subunit  of  human  PKA  {hPKA). 
Fluman  PKA  and  AID  were  incubated  at  a  1 :2  or  1 :1  ratio  in  the  presence  of  1 00 
piM  ATP  and  P^P]ATP.  About  50%  of  AID  is  phosphorylated  (A/D-P*)  by  human 
PKA  (at  a  1:1  ratio)  as  shown  by  the  presence  of  a  slower  migrating  band  on  a 
Coomassie-stained  SDS-polyacrylamide  gel.  The  catalytic  subunit  of  hPKA 
alone  is  autophosphorylated  in  the  presence  of  ATP  (45).  b,  phosphorylation 
of  AID  does  not  alter  the  requirement  for  activation  by  RNase  (A/D-P*;  lanes 
6-9)  or  change  AID  activity  on  ssDNA.  c,  phosphorylation  of  AID  has  no  stim¬ 
ulating  effect  on  the  activity  of  AID  on  the  transcribed  dsDNA  even  in  the 
presence  of  RPA.  RPA  exhibits  similar  inhibitory  effect  on  both  low  phospho¬ 
rylated  AID  and  highly  phosphorylated  AID  (A/D-P*)  in  a  T7-driven  transcrip¬ 
tion  deamination  assay.  The  deamination  efficiencies  were  normalized  to  the 
activity  of  low  phosphorylated  AID  in  the  absence  of  RPA. 


quently,  a  comparative  analysis  of  WT  and  mutant  AID  could 
provide  biologically  relevant  insights  into  the  role  of  phospho¬ 
rylation  in  initiating  SHM  and  CSR. 

A  comparison  of  WT  AID  with  phosphorylation-null  AID 
mutants  shows  major  differences  in  deamination  spectra  (Fig. 
3)  and  trinucleotide  motif  specificity  (Table  2).  However,  there 
appear  to  be  no  significant  differences  in  any  of  the  other  bio¬ 
chemical  properties  of  AID.  The  specific  activities  of  WT  and 
mutant  .^-expressed  human  AID  are  similar  to  within  a  factor 
of  3  (Fig.  2).  Given  the  inherent  variability  in  enzyme  prepara- 
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tions,  these  differences  are  probably  not  important.  Much  more 
important,  however,  is  that  phosphorylation  of  AID  at  any  of 
the  three  serine  residues  is  not  essential  for  AID  activity.  With 
relevance  to  human  immunodeficiency  disease,  the  S43P  mu¬ 
tant  associated  with  HIGM-2  syndrome  shows  essentially  nor¬ 
mal  activity  on  ssDNA  (Fig.  2)  and  transcribed  dsDNA  (Figs.  5 
and  6)  but  generates  a  significantly  different  deamination-de¬ 
pendent  C  — >  T  mutation  spectrum  compared  with  WT  AID 
(Fig.  3b), 

A  hallmark  property  of  WT  AID  is  its  ability  to  deaminate 
ssDNA  in  vitro  with  high  processivity,  using  a  combination  of 
random  sliding  and  jumping  modes  of  action  (36,  37).  The 
phosphorylation-null  AID  mutants  are  as  processive  as  WT 
AID;  the  mutants  and  WT  AID  generate  about  the  same  aver¬ 
age  numbers  of  mutations/clone  (Table  1),  exhibit  similar  pro¬ 
cessivity  factors  {e.g.  see  Fig.  4b,  comparing  WT  AID  and  S38A 
mutant  AID),  and  catalyze  about  the  same  fraction  of  correlated 
double  deaminations  (Fig.  4d), 

When  T7  RNA  polymerase  was  used  to  transcribe  linear 
dsDNA  (Figs.  5  and  6)  or  supercoiled  circular  dsDNA  (Table  4), 
there  were  no  discernible  differences  in  the  deamination  activ¬ 
ity  of  WT  AID  compared  with  phosphorylation-null  AID 
mutants,  either  in  the  presence  or  absence  of  RPA  (Fig.  6). 
These  data  are  in  contrast  to  a  previous  report  showing  that 
mouse  AID  obtained  from  B-cells  requires  phosphorylation  at 
Ser^^  and  the  presence  of  RPA  to  deaminate  linear  dsDNA  dur¬ 
ing  transcription  by  T7  RNA  polymerase  (22,  27).  Although  we 
cannot  explain  the  apparent  discrepancy  between  the  two 
results,  our  data  with  phosphorylation-null  mutants  suggest 
that  neither  phosphorylation  per  se  nor  RPA  is  needed  for 
transcription-dependent  deamination  on  either  linear  or  circu¬ 
lar  dsDNA,  using  GST-tagged  human  AID  expressed  either  in 
insect  cells  (Figs.  5  and  6  and  Table  4)  or  in  E,  coli  (Fig.  5)  (see 
also  Ref.  9).  There  is  also  no  apparent  increase  in  transcription- 
dependent  deamination  for  AID  that  has  been  about  50%  phos- 
phorylated  by  the  catalytic  subunit  of  PKA,  irrespective  of 
whether  or  not  RPA  is  included  in  the  transcription  reaction 
(Fig.  7). 

The  presence  of  a  single-stranded  binding  protein  (human 
RPA  or  E,  coli  SSB)  at  a  saturating  concentration  (one  RPA 
trimer  or  one  SSB  tetramer  per  about  30  nt  of  ssDNA)  leads  to 
an  almost  complete  inhibition  of  AID  activity  on  both  short 
oligonucleotide  and  gapped  ssDNA  substrates  (Fig.  6,  c  and  d). 
We  suggest  that  by  binding  to  ssDNA,  RPA  and  SSB  shield 
templates  from  deamination  by  AID.  The  similar  inhibitory 
effects  of  RPA  and  SSB  suggest  that  selective  binding  of  RPA  to 
AID  is  not  occurring,  nor  is  phosphorylation  of  AID  necessary 
(Fig.  6),  at  least  for  the  case  of  human  AID  expressed  in  insect 
cells.  The  absence  of  detectable  deamination  after  a  relatively 
lengthy  (10  min)  incubation  time  suggests  that  AID  is  unable  to 
displace  RPA  or  SSB  from  ssDNA  (Fig.  6c).  Perhaps  RPA  might 
act  to  protect  B  cells  from  a  potential  “global”  mutator  activity 
of  AID  when  DNA  becomes  transiently  single-stranded  during 
normal  replication  and  repair  processes. 

In  contrast  to  the  absence  of  discernible  differences  in  spe¬ 
cific  activity,  processivity,  and  transcription-dependent  deami¬ 
nation,  the  replacement  of  Ser^^  with  Ala  or  Ser'^^  with  Pro  has 
a  significant  effect  on  deamination  motif  specificity  and  C  — >  T 


mutation  spectra  that  distinguish  the  phosphorylation-null 
mutants  from  WT  AID  (Fig.  3b).  The  spectral  changes  are  most 
pronounced  in  lacZa  region  R2  (Fig.  3b,  A  difference  spectra), 
where  the  AID  mutants  are  much  more  active  than  WT  AID, 
and  in  regions  R^^  and  R4,  where  WT  AID-catalyzed  mutations 
predominate  (Fig.  3b,  A  difference  spectra).  The  spectra  for  all 
of  the  phosphorylation-null  AID  mutants  at  amino  acid  resi¬ 
dues  38,  41,  and  43  are  similar  to  each  other  (Fig.  3b  and  sup¬ 
plemental  Fig.  1)  and  are  substantially  distinct  from  WT  AID  {p 
values  of  —10“^). 

WT  AID  and  phosphorylation-null  AID  mutants  share  the 
same  general  motif  preferences,  where  WRC  hot  spot  motifs 
are  generally  favored  over  intermediate  motifs  and  SYC  cold 
spot  motifs  (Table  2).  However,  the  average  MI  for  WRC  hot 
spot  motifs  is  higher  for  WT  AID  (average  MI  —2.5)  (Table  2) 
compared  with  S38A  and  S43P  AID  mutants  (average  MI  —2.0) 
(Table  2),  thus  providing  a  possible  explanation  for  excess  WT 
AID  deaminations  in  regions  R^^  and  R4.  An  analysis  of  individ¬ 
ual  DNA  clones  used  to  compile  the  mutational  spectra  shown 
in  Fig.  3b  reveals  that  mutations  occur  most  frequently  in  clus¬ 
ters  (11).  We  have  attributed  mutational  clustering  to  a  proces¬ 
sive  bidirectional  sliding  of  AID  along  ssDNA  (10,  11).  Muta¬ 
tions  initiated  in  hot  spot  motifs  lead  to  additional  nearby 
mutations,  thus  amplifying  differences  between  WT  and 
mutant  AID  spectra  in  R^  and  R4  (Fig.  3b,  A  difference  spectra). 

Favored  deaminations  occurring  in  R2  for  the  phosphoryla¬ 
tion-null  AID  mutants  (Fig.  3b)  appear  to  have  a  more  subtle 
origin  that  may  reflect  differences  in  individual  intermediate 
motif  deamination  efficiencies.  The  phosphorylation-null 
mutants  are  much  more  apt  than  WT  AID  to  attack  two  inter¬ 
mediate  motifs,  CGC  and  GGC  (Table  2).  The  possible  rele¬ 
vance  of  this  observation  is  that  CGC  and  GGC  appear  five  and 
four  times,  respectively,  in  R2,  each  having  a  mutability  index 
favoring  mutant  over  WT  deaminations  by  factors  of  2.7  {p  = 
0.004)  and  3.4  (p  =  0.0001),  respectively,  whereas  ACC,  which 
is  the  favored  intermediate  motif  for  WT  relative  to  mutant 
AID  (Table  2,  MI  ratio  —3.6,  p  =  0.02),  is  absent  from  R2.  A 
clustering  of  mutations  proximal  to  the  intermediate  motifs 
could  amplify  differences  that  would  favor  mutant  AID-cata¬ 
lyzed  deaminations  in  R2. 

Studies  of  AID  mutations  (T27A  and  S38A)  have  led  to  a 
suggestion  that  phosphorylation  of  AID  in  vivo  plays  an  impor¬ 
tant  role  in  CSR  and  SHM.  But  what  that  role  is  has  yet  to  be 
established.  Since  phosphorylated  AID  is  preferentially 
enriched  in  the  chromatin  fraction  (23),  perhaps  phosphoryla¬ 
tion  is  needed  for  transporting  AID  from  the  cytoplasm  into  the 
nucleus,  or  to  target  AID  to  actively  transcribed  V-genes  and 
switch  regions.  Whatever  the  role  turns  out  to  be,  it  is  essential 
to  determine  how  phosphorylation  modulates  the  biochemical 
properties  of  AID.  Complicating  matters,  however,  is  that 
zebrafish  AID,  which  has  no  equivalent  Ser^^  residue,  is  fully 
competent  in  both  CSR  and  SHM  (26,  28,  29).  Therefore,  it  is 
imperative  to  distinguish  between  effects  attributable  to  the 
absence  of  phosphorylation  at  a  specific  Ser  amino  acid  residue 
versus  replacement  of  the  Ser  residue  by  a  different  amino  acid 
{e.g  Ala  or  Pro). 

Our  data  show  that  the  biochemical  characteristics  of  AID, 
including  specific  activity,  processivity,  and  transcription-de- 
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pendent  deamination,  are  not  affected  either  by  the  absence  of 
phosphorylation  on  serine  residues  38,  41,  and  43  or  by  excess 
phosphorylation.  The  replacement  of  each  of  these  residues, 
however,  results  in  a  significant  change  in  mutant  versus  WT 
AID  deamination  spectra.  The  AID  mutants  show  a  high  level 
of  deamination  notably  at  two  non-hot  spot  motifs,  GGC  and 
CGC,  where  WT  AID  is  relatively  inactive  (Table  2). 

Switch  region  DNA  sequences  contain  remarkably  high  fre¬ 
quencies  of  GGC  motifs.  There  are  196  GGCs  in  the  S-region  of 
IgG4  and  126  in  the  S-region  of  IgE,  which  are  by  far  the  most 
prevalent  triplet  motifs  in  these  two  switch  regions,  comprising 
'-15%  in  IgG  and  —19%  in  IgE  (Table  3).  Since  WT  AID  deami¬ 
nation  spectra  measured  on  ssDNA  and  on  transcribed  dsDNA 
in  vitro  agree  well  with  C  ^  T  SHM  spectra  in  humans  (10, 11), 
by  analogy  it  seems  possible,  perhaps  even  likely,  that  S43  A  and 
S43P  mutant  AID  might  introduce  high  levels  of  U  at  GGC 
switch  region  motifs.  Thus,  if  aberrant  recombination  were  to 
result  from  excess  deamination  targeted  to  noncanonical 
switch  region  motifs,  that  might  contribute  to  the  delay  in 
S38A-initiated  CSR  in  cultured  B  cells  (23)  and  perhaps  to  the 
elimination  of  CSR  in  HIGM-2  patients  having  a  S43P  mutant 
form  of  AID  (32). 

In  a  general  sense,  we  have  observed  that  the  motifs  that  WT 
AID  favors  in  vitro  are  strikingly  similar  to  hot  spot  motifs  of 
SHM  in  vivo  (10),  although  the  enzyme  must  contend  with 
chromatin,  transcription,  and  assuredly  with  AID-associated 
proteins  in  B  cells  (3-5).  The  ability  to  simulate  biochemically 
several  important  biological  functions  of  AID  suggests  that 
many  characteristics  of  SHM  and  CSR  are  determined  by  its 
intrinsic  properties.  Our  data  suggest  the  prospect  that  altered 
deamination  specificity  caused  by  amino  acid  replacements  at 
Ser^^,  Ser'^^,  or  Ser'^^  could  perhaps  account  for  significant  defi¬ 
ciencies  in  SHM  and  CSR  in  B-cells. 
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APOBEC3G  (A3G)  restricts  HIV-l  infection  by  catalyzing 
processive  C  — >  U  deaminations  on  single-stranded  DNA 
(ssDNA)  with  marked  3'  — >  5'  deamination  polarity.  Here  we 
show  that  A3G  exists  in  oligomeric  states  whose  composition  is 
dictated  primarily  by  interactions  with  DNA,  with  salt  playing 
an  important,  yet  secondary,  role.  Directional  deaminations 
correlate  with  the  presence  of  dimers,  tetramers,  and  larger  oli¬ 
gomers  observed  by  atomic  force  microscopy,  and  random 
deaminations  appear  to  correlate  mainly  with  monomers.  The 
presence  of  a  30-nt  weakly  deaminated  “dead”  zone  located  at  the 
3'-ssDNA  end  implies  the  presence  of  a  preferred  asymmetric 
direction  for  A3G  catalysis.  Single  turnover  reaction  rates  reveal  a 
salt-dependent  inhibition  of  C  deamination  toward  the  3'-ssDNA 
region,  offering  a  molecular  basis  underlying  A3G  deamination 
polarity.  Presteady  state  analysis  demonstrates  rapid  diffusion- 
limited  A3G-ssDNA  binding,  a  slower  salt-dependent  conforma¬ 
tional  change,  possibly  indicative  of  DNA  wrapping,  and  long 
(5-15  min)  protein-DNA  complex  lifetimes.  We  suggest  that 
diverse  A3G  oligomerization  modes  contribute  to  the  human 
immunodeficiency  virus,  type  1,  proviral  DNA  mutational  bias. 


In  2002,  Sheehy  et  al  (1)  determined  that  APOBEC3G 
(A3G),  originally  called  GEM  15,  a  proposed  cytidine  deaminase 
based  on  sequence  analysis,  is  the  nonpermissive  host  factor 
that  blocks  virion  infectivity  factor-defective  {Isvij)  HIV-1 
infection  of  T  cells.  The  experimental  determination  that 
CEM15  was  a  cytidine  deaminase  belonging  to  the  APOBEC 
family  followed  soon  after  with  experiments  demonstrating 
G  — >  A-induced  hypermutation  of  proviral  DNA  in  a  Av;/ 
HIV-1  virion  (2,  3).  A3G  has  a  duplicated  deaminase  domain 
structure,  but  only  the  C-terminal  domain  is  responsible  for  the 
single-stranded  DNA  (ssDNA)^  deamination  activity  (4,  5). 
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force  microscopy;  dsDNA,  double-stranded  DNA;  HIV,  human  immuno¬ 
deficiency  virus;  nt,  nucleotide(s). 


Apart  from  A3G-catalyzed  deamination,  A3G  may  also  have 
the  capacity  for  blocking  reverse  transcription,  (+)-DNA  syn¬ 
thesis,  and  provirus  formation  either  by  interacting  with  RNA 
or  DNA  of  HIV-1  (6  -  8)  or  by  possibly  interacting  with  HIV-1 
proteins  (9,  10).  However,  these  noncatalytic  effects  on  HIV 
inhibition  may  be  attributable  to  the  overexpression  of  A3G 
and  may  not  be  occurring  during  normal  infection  (11-13). 
It  is  important  to  bear  in  mind  that  any  actions  of  A3G  on 
DNA,  catalytic  and  possibly  noncatalytic,  are  balanced  in 
vivo  against  cellular  RNA  binding,  which  forms  a  high 
molecular  mass  A3G-RNA  complex,  which  may  prevent 
A3G  incorporation  into  virions  (7,  14-16),  and  against  HIV 
RNA  binding,  which  forms  an  intravirion  A3G  complex  in 
which  A3G  must  be  activated  by  HIV  RNase  H  for  DNA 
deamination  to  ensue  (16). 

We  have  shown  that  A3G -catalyzed  deamination  occurs 
processively  while  exhibiting  a  3'  — >  5'  polarity  favoring  deami¬ 
nation  toward  the  5'-region  of  ssDNA  (15).  Directional  deami¬ 
nation  is  an  intrinsic  property  of  A3G,  occurring  in  the  absence 
of  an  obvious  source  of  energy  {e,g.  ATP  or  GTP)  (15)  and  can  in 
principle  contribute  to  the  HIV-1  G  — >  A  mutational  bias, 
increasing  in  a  3 '-direction  in  the  RNA  genome  (17,  18).  A3G 
processivity  appears  consistent  with  a  three-dimensional  facil¬ 
itated  diffusion  process  involving  sliding  and  jumping  motions 
along  the  DNA  (15),  where  jumping  involves  microscopic  dis¬ 
sociation  and  reassociation  events  occurring  with  the  same 
DNA  molecule,  rather  than  diffusion  into  the  surrounding  bulk 
solution  (19-21).  Three-dimensional  processive  movement 
has  been  documented  for  a  variety  of  site-specific  DNA-target- 
ing  proteins  {e,g,  restriction  endonucleases)  (21-24),  but  in 
contrast  to  A3G,  catalysis  along  the  DNA  is  random.  Random 
processive  deaminations  also  occur  with  activation-induced 
cytidine  deaminase  (AID),  an  APOBEC  family  member  with  a 
single  deaminase  domain  (25). 

We  have  previously  suggested  that  A3G  deamination  is 
strongly  favored  toward  the  5 '-direction,  perhaps  because  of 
a  catalytic  orientation  specificity  and  structural  asymmetries 
in  the  enzyme  (15).  The  presence  of  multiple  catalytic  and 
binding  domains  in  A3G  contained  in  an  elongated  rod¬ 
shaped  oligomeric  structure  found  in  this  deaminase  family 
(26,  27)  has  the  potential  to  impose  catalytic  directionality. 
A3G  has  been  structurally  characterized  in  solution  as  a  tail- 
to-tail  dimer  (26),  but  monomeric  forms  are  also  active  (28  - 
30).  When  bound  to  DNA,  A3G  may  exist  in  different  oligo¬ 
meric  states,  as  implied  by  multiple  A3G-DNA  bands  in  gel 
shift  analyses  (15,  31). 
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In  the  present  work,  using  human  A3G  produced  from  bacu- 
loviruS'infected  Sfi  insect  cells,  we  have  applied  presteady  state 
kinetic  analyses  in  combination  with  atomic  force  microscopy 
(AFM)  to  investigate  the  dynamic  and  structural  determinants 
of  A3G  processivity  and  catalytic  polarity.  We  provide  evidence 
showing  that  the  3'  — >  5'  bias  in  deamination  of  C  — >  U  appears 
to  be  determined  by  the  oligomeric  state  of  the  enzyme,  with 
populations  of  dimers,  tetramers,  and  perhaps  larger  oligomers 
favoring  directionally  biased  deamination  and  most  likely 
monomers  catalyzing  random  deamination.  Both  the  direc¬ 
tional  and  random  deamination  modes  occur  with  high  proces¬ 
sivity,  readily  observable  on  a  time  scale  of  a  few  seconds. 

EXPERIMENTAL  PROCEDURES 

Substrates — DNA  oligonucleotides  were  synthesized  on  an 
Applied  Biosystems  3400  DNA/RNA  synthesizer  and  are  listed 
in  supplemental  Table  SI.  Fluorescein-dT  and  pyrrolo-dC  were 
purchased  from  Glen  Research.  The  partially  dsDNA  substrates 
were  constructed  by  annealing  with  a  1.4-fold  molar  excess  of 
complementary  ssDNA.  A  circular  DNA  substrate  was  con¬ 
structed  by  5 '-end  phosphorylation  of  a  118-nt  ssDNA  sub¬ 
strate  with  T4  polynucleotide  kinase  (New  England  Biolabs) 
and  subsequent  ligation  using  a  linker  DNA  oligonucleotide. 
The  circular  product  was  then  purified  after  resolution  on  a 
10%  denaturing  polyacrylamide  gel. 

A3G  and  AID  Expression  and  Purification— K  baculovirus- 
expressed  GST-A3G  protein  was  constructed  as  described  pre¬ 
viously  (15).  A3G  recombinant  virus  was  obtained  as  described 
previously  (32).  Sfi  cells  were  infected  with  recombinant  A3G 
virus  at  a  multiplicity  of  infection  of  1  and  harvested  after  72  h. 
Cells  were  lysed  as  described  previously  (15)  in  the  presence  of 
50  jULg/ml  RNase  A.  Cleared  lysates  were  then  incubated  with 
glutathione-Sepharose  resin  (GE  Healthcare)  and  subjected  to  a 
series  of  salt  washes  (0.25-1  m  NaCl)  before  elution  in  buffer  as 
previously  described  (15).  A3G  was  treated  with  0.02  units/ jutl 
thrombin  (GE  Healthcare)  for  16  h  at  21  °C  to  release  the  glu¬ 
tathione  ^'-transferase  tag,  and  subsequently  the  sample  was 
diluted  to  50  mM  NaCl  and  loaded  onto  a  DEAF  FF  column  (GE 
Healthcare)  equilibrated  with  50  mM  Tris-HCl,  pH  8.9,  50  mM 
NaCl,  1  mM  dithiothreitol,  and  10%  glycerol.  A3G  was  eluted  at 
—200  mM  NaCl  by  using  a  linear  gradient  from  75  to  1000  mM 
NaCl.  Fractions  were  collected  and  stored  at  —  70°C.  A3G  is 
—95%  pure.  AID  was  purified  as  previously  described  (32). 

Deamination  Assays — A3G  activity  on  ssDNA  substrates  was 
measured  in  reactions  containing  50  nM  ^^P-labeled  DNA,  5  or 
500  nM  A3G  in  buffer  (50  mM  HEPES,  pH  7.3, 1  mM  dithiothre¬ 
itol,  and  varied  amounts  of  MgCl2  or  other  salts  where  indi¬ 
cated)  and  incubated  for  2.5-  60  min  at  37  °C.  ssDNA  sub¬ 
strates  (0.3- 0.5  pu)  with  an  internal  fluorescein  label  were 
incubated  with  A3G  or  AID  (20  -300  um),  and  deaminations 
were  carried  out  as  described  above.  All  reactions  with  partial 
dsDNA  substrates  also  contained  30  mM  NaCl  to  stabilize  the 
duplex  region.  Reactions  were  terminated  by  an  extraction  with 
phenol/chloroform/isoamyl  alcohol  (25:24:1),  and  deamina¬ 
tions  were  detected  by  the  uracil  DNA  glycosylase  assay,  as 
described  previously  (15).  Circular  substrates  were  linearized 
before  PAGE  resolution  by  annealing  to  a  30-nt  complementary 
oligonucleotide  containing  an  Msel  site  and  incubating  with  5 


units  of  Msel  for  60  min  at  37  °C.  Gel  band  intensities  were 
visualized  by  phosphorimaging  or  with  a  EX  fluorescence  scan¬ 
ner  (Bio-Rad)  and  then  measured  with  ImageQuant  software 
(Amersham  Biosciences). 

Presteady  state  reactions  were  mechanically  sampled  and 
quenched  using  a  KinTek  quench-flow  instrument.  A  solution 
of  DNA  and  A3G  at  twice  the  final  intended  concentration  was 
rapidly  mixed  in  equal  volumes  and  allowed  to  react  for  a  given 
amount  of  time  (0.1-180  s)  before  quenching  with  phenol/ 
chloroform/isoamyl  alcohol  (25:24:1).  Samples  were  then 
treated  as  described  above. 

Analysis  ofProcessive  Deamination  by  a  Single  A3G  or  AID — 
The  analysis  of  processive  deamination  described  previously 
(15,  25)  was  used.  This  entails  the  quantification  of  integrated 
band  intensity  of  fractions  of  substrates  with  a  single  deamina¬ 
tion  at  the  5'  C  or  3'  C  or  a  double  deamination  at  both  the  5'  C 
and  3'  C.  The  predicted  fraction  of  independent  double  deami¬ 
nations  is  then  calculated  by  multiplying  the  fraction  of  all 
deaminations  occurring  at  the  5'  C  (sum  of  the  integrated  gel 
band  intensities  from  deaminations  occurring  at  5'  C  and  dou¬ 
ble  deaminations  (5'  C  and  3'  C))  to  the  fraction  of  all  deami¬ 
nations  occurring  at  the  3'  C  (sum  of  the  integrated  gel  band 
intensities  from  deaminations  occurring  at  3'  C  and  double 
deaminations  (5'  C  and  3'  C)).  The  ratio  of  the  observed  frac¬ 
tion  of  double  deaminations  (occurring  at  both  5'  C  and  3'  C  on 
the  same  molecule)  to  the  predicted  fraction  of  independent 
double  deaminations  is  the  “processivity  factor”  and  was  used 
to  determine  processive  {Le.  correlated  double)  deamination  of 
two  target  motifs.  A  ratio  larger  than  1  indicates  that  a  majority 
of  double  deaminations  are  caused  by  the  same  A3G  or  AID 
molecule  acting  processively  on  both  target  motifs. 

Presteady  State  Anisotropy  and  Fluorescence  Assays — Reac¬ 
tions  were  at  37  °C  in  buffer  containing  50  mM  HEPES,  pH  7.3, 
1  mM  dithiothreitol,  and  0  or  5  mM  MgCl2. 

Stopped  flow  anisotropy  measurements  were  performed 
using  an  Applied  Photophysics  H'TSO  spectrofluorometer 
equipped  for  anisotropy  by  exciting  with  vertically  polarized 
light  at  494  nm  (2.5-nm  slit  width)  from  a  xenon  lamp  and 
monitoring  emissions  using  a  515-nm  cut-off  filter.  To  deter¬ 
mine  on-rates,  0.8  -2  pu  A3G  was  incubated  in  one  syringe, 
and  200  nM  fluorescein-labeled  69-nt  ssDNA  substrate  was 
incubated  in  a  second  syringe,  before  rapidly  mixing  50  pi  from 
each  syringe  and  monitoring  the  change  in  anisotropy  for  0.2  s. 
The  anisotropy  change  as  a  function  of  time  was  fit  to  a  double 
exponential  equation  to  obtain  on-rates.  To  determine  off- 
rates,  in  one  syringe,  100  nM  fluorescein-labeled  69-nt  ssDNA 
substrate  was  preincubated  with  400  nM  A3G.  The  second 
syringe  contained  10  or  20  pu  unlabeled  ssDNA  substrate.  The 
reaction  was  initiated  by  mixing  equal  volumes  of  the  syringes 
(50  pi)  and  monitoring  the  change  in  anisotropy  for  500-1000  s. 
The  anisotropy  change  as  a  function  of  time  was  fit  to  a  double 
exponential  decay  to  obtain  off-rates. 

Stopped  flow  fluorescence  measurements  were  performed 
using  an  Applied  Photophysics  n'T80  spectrofluorometer 
equipped  for  fluorescence.  The  ssDNA  containing  pyrrolo-C 
was  excited  at  335  nm  (2-nm  slit  width)  with  a  xenon-mercury 
lamp,  and  emissions  were  monitored  using  a  435-nm  cut-off 
filter.  To  determine  fluorescence  changes,  measured  in  volts. 
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FIGURE  1 .  A3G  deamination  activity,  polarity,  and  processivity  on  ssDNA  is  dependent  on  l\/lgCl2  and  the 
presence  of  a  dsDNA  region.  AandB,  specific  activity  of  A3G  (5  nivi)  is  plotted  as  a  function  of  the  distance  (c/) 
of  the  CCC  motif  from  the  3'-end  of  a  69-nt  ssDNA  substrate  (50  nivi)  deaminated  in  the  presence  of  5  mM  MgCl2 
{A)  or  absence  of  MgCl2  {B).  Error  bars,  ±  1  S.E.  from  three  independent  experiments.  Cand  D,  A3G  deamination 
of  a  1 1 8-nt  internally  fluorescein-labeled  ssDNA  substrate  at  an  enzyme/DNA  ratio  of  1 :20  in  the  absence  (C)  or 
presence  (D)  of  a  20-nt  complementary  oligonucleotide  annealed  between  the  two  C  targets.  Single  deami¬ 
nations  of  the  5'  C  and  3'  C  are  detected  as  the  appearance  of  labeled  1 00-  and  81  -nt  fragments,  respectively; 
double  deamination  of  both  Cs  on  the  same  molecule  results  in  a  63-nt  labeled  fragment  (5'  C  and  3'  C). 
Substrate  usages  (%),  processivity  factors,  and  ratios  of  5'  C/3'  C  deaminations  are  shown  below  the  gels.  For 
deamination  reactions  with  partially  dsDNA,  30  mM  NaCI  was  present  to  stabilize  the  duplex  region.  The 
reaction  on  ssDNA  shown  in  C  is  unaffected  by  the  presence  of  30  mM  NaCI. 


2  fjLM  A3G  was  incubated  in  one  syringe,  and  200  nM  pyrrolo- 
C'labeled  69'nt  ssDNA  substrate  was  incubated  in  a  second 
syringe,  before  rapidly  mixing  50  jutl  from  each  syringe  and 
monitoring  the  change  in  fluorescence  emission  for  0.6  s.  The 
fluorescence  change  as  a  function  of  time  was  fit  to  a  double 
exponential  equation  to  obtain  rates. 

Atomic  Force  Microscopy — A3G  imaging  buffer  (50  mM 
HEPES,  pH  7.3)  either  with  or  without  5  mM  MgCl2  was  heated 
to  65  °C  and  allowed  to  cool  slowly  to  dissolve  salts.  A3G  was 
then  diluted  to  100  nM  in  A3G  imaging  buffer,  and  0.1  mM 
dithiothreitol  (final  concentration)  was  added  to  the  sample. 
Ten  microliters  of  the  protein  sample  were  then  deposited  onto 
freshly  cleaved  mica  (Spruce  Pine  Mica  Company,  Spruce  Pine, 
NC).  The  sample  was  rinsed  immediately  with  nanopure  water, 
excess  water  was  wicked  from  the  surface  using  filter  paper,  and 
the  surface  was  then  dried  using  a  stream  of  nitrogen  before 
being  placed  in  the  AEM  for  imaging.  Eor  experiments  done  in 
the  presence  of  DNA  substrate,  200  nM  (final  concentration)  of 
ssDNA  substrate  was  added  to  the  A3G  sample  before  deposit¬ 
ing  as  for  the  protein  alone. 

All  AEM  images  were  captured  in  air  using  either  a  Nano¬ 
scope  III  or  Ilia  (Digital  Instruments,  Santa  Barbara,  CA) 
microscope  in  tapping  mode.  Pointprobe  Plus  tapping  mode 
silicon  probes  (Agilent,  Tempe,  AZ)  with  resonance  frequen¬ 
cies  of  '■--170  kHz  were  used  for  imaging.  Images  were  collected 


at  a  speed  of  2-3  Hz  with  an  image 
size  of  1  X  1  pm  at  512  X  512  pixel 
resolution.  Each  experiment  was 
repeated  at  least  twice.  Volume 
analysis  was  done  as  previously 
described  (33,  34).  The  program 
Kaleidagraph  (Synergy  Software, 
Reading,  PA)  was  used  to  generate 
statistical  plots. 

RESULTS 

Electrostatic  Influence  on  A3G  De- 
amination  Polarity  and  Processivity — 
We  previously  observed  that  A3G 
deamination  on  linear  ssDNA  exhib¬ 
its  a  marked  deamination  bias,  where 
the  rate  of  deaminating  the  third  C  in 
the  5' -CCC  target  to  U  is  enhanced 
significantly  as  the  motif  is  located 
further  away  from  the  ssDNA  3 '-end 
(15).  The  earlier  experiments  were 
performed  in  the  presence  of  salt 
(MgCl2)  (15).  To  study  A3G  deamina¬ 
tion  bias  at  higher  resolution,  with 
and  without  salt,  we  used  11  ssDNA 
substrates,  each  with  a  5'-A(G/T)A 
CCC  AAA  hot  spot  motif  in  a  differ¬ 
ent  location  along  a  69-nt  ssDNA 
molecule  (Eig.  1,  A  and^). 

A  “dead”  zone  containing  30  nt 
was  present  at  the  3 '-end  of  the 
ssDNA  substrate,  where  the  specific 
activity  was  <1  pmol/jutg/min  irre¬ 
spective  of  target  position  or  salt  concentration  (Eig.  1,  A  and 
^).  We  speculate  that  the  30-nt  dead  zone  may  be  the  optimal 
space  A3G  needs  at  a  3 '-ssDNA  end  to  encounter  a  CCC  motif 
in  an  orientation  allowing  catalysis.  There  was  a  monotonic 
increase  in  A3G  specific  activity,  in  the  presence  of  5  mM 
MgCl2,  as  the  target  motif  was  moved  out  of  the  dead  zone 
toward  the  5 '-end,  achieving  a  maximum  specific  activity  of 
11  pmol/jULg/min  (Eig.  lA).  In  the  absence  of  MgCl2  the  activ¬ 
ity  of  A3G  reached  a  peak  near  the  center  of  the  ssDNA  (--15 
pmol/jULg/min)  and  then  remained  roughly  constant  before 
decreasing  from  about  50  to  69  nt  from  the  3 '-end  (0  mM 
MgCl2)  (Eig.  1^). 

A3G  processivity  and  deamination  polarity  can  be  measured 
simultaneously  by  placing  two  CCC  motifs  on  the  same  ssDNA 
substrate  and  having  a  fluorescein  label  in  between  (15, 25)  (Eig. 
1C).  The  processivity  factor  (see  “Experimental  Procedures”) 
measures  the  fraction  of  correlated  double  deaminations 
occurring  at  two  CCC  motifs  located  on  the  same  ssDNA  sub¬ 
strate  by  a  single  enzyme  molecule  compared  with  uncorrelated 
double  deaminations  catalyzed  independently  by  two  different 
A3G  molecules  (15,  25).  The  analysis  requires  that  less  than 
'-15%  of  the  substrate  be  used  to  maintain  single-hit  kinetics 
(35).  A3G  processivity  did  not  change  significantly  as  a  function 
of  MgCl2  concentration,  where  the  processivity  factors  are  in 
the  range  of  5-9  (Pig.  1C).  A  measurement  of  A3G  deamination 
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FIGURE  2.  A3G  directional  bias  is  absent  on  circular  ssDNA  and  present  on 
partially  double-stranded  circular  DNA.  A3G  deamination  bias  and  proces¬ 
sivity  were  characterized  on  circular  fluorescein-labeled  DNA  substrates  con¬ 
taining  two  CCC  motifs  using  an  enzyme/DNA  ratio  of  1 :20.  A  {sketch),  deami¬ 
nation  events  on  a  118-nt  circular  ssDNA  are  visualized  after  a  30-nt 
complementary  oligonucleotide  is  annealed  to  the  circular  DNA  to  enable 
cutting  by  Msel  {sketch).  This  procedure  linearizes  the  circular  DNA  so  that  all 
possible  single  and  double  deamination  events  can  be  detected  using  the 
uracil  DNA  glycosylase  (UDG)  assay  (see  "Experimental  Procedures").  An  X 
represents  an  abasic  site.  A,  A3G  exhibits  processive  but  not  directionally 


polarity  on  the  two  C  targets,  which  are  out  of  the  deamination 
dead  zone,  as  a  function  of  salt  concentration,  shows  that 
deamination  at  the  5'  C  target  (100  nt  from  the  3 '-end)  is 
favored  over  the  3'  C  target  (36  nt  from  the  3 '-end)  by  roughly 
2-fold  at  MgCl2  concentrations  of  >5  mM  (5'  C/3'  C  deamina¬ 
tion  ratio;  Fig.  1C).  The  deamination  polarity  decreased  with 
less  salt  until,  in  the  near  absence  of  MgCl2,  A3G  deaminated 
the  C  targets  with  no  preference  (5'  C/3'  C  deamination  ratio 
~1;  Fig.  1C). 

The  directional  bias  of  A3G  deamination  was  clearly  electro¬ 
static  in  origin  as  opposed  to  a  specific  interaction  with  Mg^^, 
because  favored  deamination  toward  the  5 '-end  also  occurred 
with  CaCl2,  MnCl2,  and  NaCl  (supplemental  Fig.  SI).  In  the 
case  of  monovalent  NaCl,  a  20-fold  higher  concentration  was 
required  to  obtain  a  similar  5 '-end  deamination  bias  observed 
for  divalent  salts. 

We  reexamined  the  polar  and  processive  properties  of  A3G 
on  the  118-nt  ssDNA  (Fig.  1C)  but  with  a  20-nt  oligonucleotide 
annealed  between  the  deamination  targets  (Fig.  ID).  Since  A3G 
bound  dsDNA  with  a  low  GC  content  at  5-fold  lower  affinity 
than  ssDNA  (supplemental  Fig.  S2)  and  does  not  bind  dsDNA 
with  a  high  GC  content  (15),  a  loss  in  the  sliding  component  of 
the  processive  motion  is  expected  (15,  31).  However,  the  jump¬ 
ing  component  of  processivity  appeared  to  be  retained,  since 
the  processivity  factor  was  reduced  from  8  to  4  (Fig.  ID).  The 
lack  of  deamination  bias  in  the  absence  of  MgCl2  (5'  C/3'  C 
deamination  ratio  is  1;  Fig.  ID)  was  expected  (see  Fig.  1C,  0  mM 
MgCl2).  However,  the  annealed  dsDNA  region  between  the  two 
deamination  targets  also  resulted  in  a  loss  of  deamination  bias 
in  the  presence  of  5  mM  MgCl2  (5'  C/3'  C  deamination  ratio  is 
'-I;  Fig.  1,  compare  C  and  D),  indicating  that  the  dsDNA 
between  the  deamination  sites  decoupled  the  two  sides  of  the 
substrate.  The  deamination  in  each  single-stranded  region 
depended  on  the  target  size  for  each  region,  the  36-nt  target 
length  from  the  3 '-end  of  the  118  nt  substrate  and  the  33-nt 
target  length  region  between  the  5'  C  and  the  dsDNA  (Fig.  ID). 
There  is,  however,  an  important  distinction  to  be  made  regarding 
the  occurrence  of  the  dead  zone  (Fig.  1,  A  and  5).  The  ~  30-nt  dead 
zone  occurs  only  at  the  3 '-end  of  the  ssDNA;  there  is  no  internal 
dead  zone  created  by  the  dsDNA  region,  but  instead  deamination 
is  dependent  of  the  nucleotide  target  size  (15). 

Deamination  Polarity  on  Circular  ssDNA  Requires  an  Inter¬ 
vening  dsDNA  Region — To  examine  what  effect  the  ssDNA 
ends  have  on  A3G  deamination  bias,  ends  were  eliminated  by 
circularizing  the  molecule  depicted  in  Fig.  1C  (Fig.  2A,  sketch). 
A3G  shows  essentially  no  deamination  bias  either  with  or  with¬ 
out  MgCl2  on  the  circular  molecule  (81  nt/100  nt  deamination 
ratio  '~1.1-1.3;  Fig.  2A).  Although  deamination  was  slightly 
inhibited  in  the  presence  of  MgCl2,  the  apparent  binding  con¬ 
stants  of  the  circular  DNA  compared  with  its  linear  counterpart 


biased  deamination  on  circular  DNA  as  a  function  of  MgC^.  B,  A3G  deamina¬ 
tion  of  circular  substrate  in  the  presence  of  a  20-nt  complementary  oligonu¬ 
cleotide  annealed  between  the  two  C  targets  is  processive,  and  the  direc¬ 
tional  bias  is  restored  with  5  mM  MgCl2.  Substrate  usages  (%),  processivity 
factors,  and  ratios  of  81  -nt  fragment  to  1 00-nt  fragment  are  shown  belowthe 
gels.  For  deamination  reactions  with  partially  dsDNA,  30  mM  NaCl  was  present 
to  stabilize  the  duplex  region.  The  reaction  on  circular  ssDNA  shown  in  A  is 
unaffected  by  the  presence  of  30  mM  NaCl. 
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were  the  same  {K^  160  nu).  A3G  was  processive  on  the  cir¬ 

cular  substrate  with  processivity  factors  similar  to  those  on  the 
linear  substrate  (compare  Fig.  2A  with  Fig.  1C). 

We  examined  the  effect  on  deamination  of  an  oligonucleo¬ 
tide  annealed  to  the  circular  DNA  substrate  to  form  a  partially 
dsDNA  circle  (Fig.  2B),  To  ensure  that  A3G  had  enough  room 
to  deaminate  both  C  targets  equally  (see  Fig.  1^),  we  placed  the 
dsDNA  at  least  33  nt  from  the  3 '-side  of  each  CCC  motif.  A3G 
was  able  to  act  processively  on  partially  dsDNA  circles  (proces¬ 
sivity  factor  between  5  and  11;  Fig.  2B).  Without  MgCl2,  A3G 
deaminated  each  C  target  with  about  equal  preference  (81 
nt/100  nt  deamination  ratio  of  1.3;  Fig.  2B),  However,  with  5 
mM  MgCl2,  A3G  regained  the  ability  to  deaminate  with  a  direc¬ 
tional  bias  (81  nt/100  nt  deamination  ratio  of  5;  Fig.  2B), 

The  dsDNA  segment,  in  the  presence  of  MgCl2,  may  rein¬ 
state  the  deamination  bias  by  limiting  the  available  nucleotide 
target  size.  That  A3G  now  favors  the  C  target  with  the  largest 
target  size  for  binding  {Le.  that  furthest  from  the  3 '-end  of  the 
dsDNA  segment)  (87  nt;  Fig.  2B)  implies  that  there  is  an  intrin¬ 
sic  3'  5'  directional  catalytic  constraint  for  A3G  deamination 

so  that  deamination  occurs  predominantly  when  A3G 
approaches  a  C  target  with  a  3'  — >  5'  orientation  on  ssDNA, 
consistent  with  our  previous  data  (15).  A3G  showed  no  deami¬ 
nation  bias  on  a  completely  single-stranded  circle  (Fig.  2A), 
presumably  because  the  enzyme  can  encounter  either  of  the 
two  C  targets  in  a  3'  — >  5'  orientation  without  a  nucleotide 
target  size  limitation  imposed  by  linear  ssDNA  ends.  A3G  did 
not  use  a  terminal  3' -OH  for  directional  orientation  (supple¬ 
mental  Fig.  S3).  Deamination  assays  with  a  biotin/streptavidin 
block  at  the  3'  -  or  5 '-end  showed  no  difference  in  the  deamina¬ 
tion  activity  or  directional  bias,  which  suggests  that  preferential 
binding  at  an  ssDNA  end  is  probably  not  occurring  (data  not 
shown). 

Temporal  Analysis  of  A3G  Deamination  Polarity  and 
Processivity — To  make  a  finer  tuned  analysis  of  the  two  deami¬ 
nation  modes  of  A3G,  directionally  biased  or  random,  we  used 
rapid-quench  kinetics  to  investigate  deamination  polarity  and 
processivity  on  short  time  scales  (several  seconds).  We  had  pre¬ 
viously  established  that  during  “long”  (10-min)  incubations, 
A3G-catalyzed  C  deaminations  occur  processively  and  with 
3'  — >  5'  polarity  in  the  presence  of  salt  (15)  (Fig.  1C).  Notably, 
the  same  degree  of  processivity  and  deamination  polarity  in  the 
presence  of  salt  was  observed  during  “short”  incubations  (Fig.  3, 
A  and  B).  The  reactions  with  5  mM  MgCl2  (Fig.  3A)  show  that  at 
the  earliest  time  that  deaminations  are  detectable  (~5  s), 
deamination  of  the  5'  C  is  favored  over  the  3'  C  by  a  factor  of  9. 
In  the  absence  of  MgCl2,  A3G  deaminated  each  C  target  with  no 
preference;  the  “3'  C”  was  36  nt  from  the  3 '-end  and  was  fully 
accessible  for  deamination  (Fig.  SB),  The  processivity  factors 
decreased  substantially,  approaching  '-I  (no  processivity)  as 
substrate  usage  went  from  4  to  >50%  (Fig.  3,  A  and  B).  The  loss 
of  processivity  at  high  substrate  usage  probably  resulted  from 
multiple  encounters  of  A3G  with  the  ssDNA  substrates,  leading 
to  independent  double  deaminations  (15),  whereas,  at  low  sub¬ 
strate  usage,  each  ssDNA  substrate  is  expected  to  encounter 
A3G  only  once,  giving  rise  to  processive  correlated  double 
deaminations. 


To  characterize  the  mechanism  of  deamination  polarity 
and  to  obtain  A3G  single  turnover  deamination  rates  at  differ¬ 
ent  target  motif  locations  along  the  ssDNA  substrate,  we 
employed  three  of  the  69-nt  ssDNA  molecules  used  previously 
(see  Fig.  1,  A  and  B)  with  one  5'-A(G/T)A  CCC  AAA  hot  spot 
motif  either  57,  36,  or  27  nt  from  the  3' -DNA  end  in  reactions 
with  a  large  excess  of  A3G  over  substrate  DNA  (A3G/DNA 
10:1)  (Fig.  3,  CandD).  The  ssDNA  molecules  were  bound  by  A3G 
with  the  same  apparent  (100  nM)  (supplemental  Fig.  S4). 

The  single  turnover  rate  for  a  reaction  containing  5  mM 
MgCl2  for  a  C  target  57  nt  from  the  3 '-end  fits  well  to  a  single 
exponential  and  yields  a  deamination  rate  of  0.25/s  (Fig.  3C), 
with  a  maximum  of  ^^100%  substrate  deamination  within  20  s 
(Fig.  3C,  inset).  In  contrast,  the  C  targets  36  or  27  nt  from  the 
3 '-end  required  1500  and  3600  s  to  reach  100%  substrate 
deamination,  respectively  (Fig.  3C).  These  data  are  not  fit  well 
by  single  exponentials  and  require  double  exponential  fits  with 
a  fast  and  a  slow  phase.  For  ssDNA  with  a  C  target  near  the 
middle  (36  nt  from  the  3 '-end),  the  rate  of  the  fast  phase  is 
0.12/s  but  only  accounts  for  23%  of  the  deaminations  taking 
place  (Fig.  3C,  inset) y  whereas  the  slower  rate,  0.004/s  (one 
deamination  every  250  s)  accounts  for  77%  of  the  deamination 
(Fig.  3C).  The  C  target  in  the  ssDNA  3 '-end  dead  zone,  27  nt 
from  the  3 '-end,  was  deaminated  the  slowest,  with  a  fast  rate  of 
0.02/s  occurring  for  about  24%  of  the  deaminations  (Fig.  3C, 
inset)  and  a  slower  rate  of  0.0007/s  for  the  remainder  (Fig.  3C). 
Biphasic  deamination  rates  under  single  turnover  conditions 
suggest  that  multiple  deamination  modes  of  A3G  exist  that  cat¬ 
alyze  deaminations  at  different  rates. 

In  the  absence  of  MgCl2,  A3G  showed  greater  ability  to 
deaminate  at  the  center  of  the  ssDNA  (Fig.  3D),  which  is 
consistent  with  specific  activities  determined  at  various 
locations  along  the  DNA  substrate  (Fig.  1^).  The  deamina¬ 
tion  rates  for  substrates  with  C  targets  57  or  36  nt  from  the 
3 '-end  were  essentially  the  same,  with  deaminations  occur¬ 
ring  every  14  s  ('~0.08/s;  Fig.  3D,  inset).  On  the  substrate  with 
a  C  target  in  the  ssDNA  3 '-end  dead  zone,  27  nt  from  the 
3 '-end,  the  initial  rate  (0.03/s)  limited  deaminations  to 
^-30  s  for  each  and  was  far  slower  than  the  other  C  targets 
(Fig.  3D,  inset).  The  second  reaction  rate  representing  45%  of 
the  amplitude  for  a  C  target  located  27  nt  from  the  3 '-end 
was  0.001/s  (Fig.  3D). 

When  we  probed  deaminations  occurring  at  individual  C 
sites  within  each  motif,  it  was  evident  that  the  third  C  in  the 
5' -CCC  was  favored  for  deamination,  the  middle  C  (5' -CCC) 
could  also  be  deaminated  on  a  longer  time  scale,  and  the  first  C 
(5' -CCC)  was  barely  deaminated  (Fig.  3D).  The  general  pattern 
of  deaminations  within  a  motif  (Fig.  3D)  was  consistent  with 
A3G  3'  — >  5'  directional  polarity  in  which  triplet  motifs 
undergo  more  rapid  deamination  nearer  the  5 '-end  of  the 
ssDNA  substrate  (Fig.  3,  A  and  C). 

Rates  ofASG  Binding  and  Release  of  ssDNA  are  Biphasic — It 
is  essential  to  view  A3G  deamination  time  scales  (Fig.  3)  within 
the  broader  milieu  of  time  scales  required  for  A3G-ssDNA 
complex  formation  and  dissociation.  We  used  presteady  state 
kinetic  analyses  of  rotational  anisotropy  and  fluorescence 
emission  to  measure  A3G  on-rates  and  off-rates  and  to 
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FIGURE  3.  Presteady  state  analysis  of  polarity,  processivity  and  deamination  rates. Tt  and  B,  processive  deannination  occurs  on  a  presteady  state  time  scale. 
A3G  was  incubated  with  an  85-nt  internally  fluorescein-labeled  ssDNA  substrate  with  two  CCC  motifs  28  nt  apart  and  at  an  enzyme/DNA  ratio  of  1 :1 .  Single 
deaminations  of  the  5'  C  and  3'  C  are  detected  as  the  appearance  of  labeled  67-  and  48-nt  fragments,  respectively;  double  deamination  of  both  Cs  on  the 
same  molecule  results  in  a  30-nt  labeled  fragment  (5'  C  and  3'  C).  Processive  deamination  was  determined  in  the  presence  of  5  mM  MgCl2  {A)  and 
absence  of  IVlgCl2  {B).  Substrate  usages  (%),  processivity  factors,  and  the  ratios  of  5'  C/3'  C  deaminations  are  shown  below  the  gels.  C  and  D,  single 
turnover  reaction  rates  for  A3G  were  determined  on  ssDNA  substrates  with  CCC  motifs  57  (•),  36  (A),  or  27  nt  (■)  from  the  3'-end  of  a  69-nt  substrate 
under  enzyme  saturating  conditions  (500  nM  A3G,  50  nM  DNA).  C,  with  5  mM  MgCl2,  the  deamination  of  a  CCC  motif  57  nt  from  the  3'-end,  measured  as 
a  function  of  time,  was  fit  to  a  single  exponential  equation  to  determine  a  single  turnover  rate  of  0.25/s.  The  deamination  of  a  CCC  motif  36  nt  from  the 
3'-end  measured  as  a  function  of  time  was  fit  to  a  double  exponential  equation  to  determine  a  minor  "fast"  rate  component  of  0.1 2/s  (23%  amplitude) 
and  a  major  "slow"  rate  component  of  0.004/s  (72%  amplitude).  The  deamination  of  a  CCC  motif  27  nt  from  the  3'-end  measured  as  a  function  of  time 
was  fit  to  a  double  exponential  equation  to  determine  a  minor  "fast"  rate  component  of  0.02/s  (24%  amplitude)  and  a  major  "slow"  rate  component  of 
0.0007/s  (70%  amplitude).  Inset  graph,  the  first  60  s  of  the  reaction.  D,  reaction  rates  determined  as  in  C  but  in  the  absence  of  MgCl2.  The  data  for 
substrates  with  CCC  motif  57  and  36  nt  from  the  3'-end  were  fit  to  a  single  exponential  equation  to  obtain  rates  of  0.07/s  and  0.08/s,  respectively.  The 
data  for  substrate  with  CCC  motif  27  nt  from  the  3'-end  were  fit  to  a  double  exponential  equation  with  an  initial  "fast"  rate  of  0.03/s  (34%  amplitude)  and 
a  second  "slow"  rate  of  0.001 /s  (45%  amplitude).  Inset  graph,  the  first  60  s  of  the  reaction.  E,  examination  of  how  CCC  motifs  are  deaminated. 
Deamination  of  the  3'  and  middle  C  residues  in  the  motif  5'-CCC-3'  is  influenced  by  distance  from  the  3'-end  of  the  ssDNA.  Deamination  of  the  5'  C 
residue  in  the  motif  5'-CCC-3'  is  barely  observed.  The  gel  reaction  buffers  contained  5  mM  MgCl2. 


investigate  possible  conformational  changes  of  the  ssDNA 
substrate  (Fig.  4). 

The  on-rate  of  A3G,  determined  with  a  10-fold  excess  of  A3G 
to  DNA  in  the  presence  and  absence  of  MgCl2,  was  extremely 
rapid,  exhibiting  an  initial  fast  phase  and  slower  second  phase 
(Fig.  4,  A  and  B).  About  half  of  the  ssDNA  was  bound  within  the 
instrument  dead  time  (1.1  ms).  The  estimated  on-rates  under 
these  conditions  are  140/s  (Fig.  4A;  5  mM  MgCl2)  and  91/s  (Fig. 


4B;  without  MgCl2).  These  measurements  were  also  made  as  a 
function  of  A3G  concentration  (0.4 -0.8  piu)  to  calculate  the 
bimolecular  rate  constants,  which  approach  the  diffusion  limit, 
with  or  without  MgCl2  (5  mM  MgCl2, 1  X  10^/m/s;  0  mM  MgCl2, 
9  X  10^/m/s).  Each  on-rate  experiment,  in  the  presence  or 
absence  of  MgCl2, confirmed  that  the  A3G  on-rate  is  biphasic 
with  an  initial  rapid  phase  (—60%  amplitude)  and  secondary 
slow  phase  (—40%  amplitude).  It  may  be  that  the  two  distinct 
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FIGURE  4.  Stopped-flow  fluorescence  presteady  state  interactions  of  A3G  with  ssDNA.  A  and  B,  the  on-rate 
of  A3G  (1  jLtM)  to  a  69-nt  fluorescein-labeled  ssDNA  substrate  (1 00  nivi)  was  determined  by  monitoring  changes 
in  rotational  anisotropy  of  free  ssDNA  {gray  line)  following  the  addition  of  A3G  {black  line)  as  a  function  of  time. 
The  on-rate  was  fit  to  a  double  exponential  equation.  A,  for  A3G  in  buffer  with  5  mM  MgCl2  the  rapid  rate  of 
140  ±  14/s  (62%  amplitude)  was  followed  by  a  slower  rate  of  15  ±  2/s  (38%  amplitude).  B,  for  A3G  in  the 
absence  of  MgCl2,  the  rapid  rate  of  91  ±  10/s  (53%  amplitude)  was  followed  by  a  slower  rate  of  10  ±  1/s(47% 
amplitude).  C  and  D,  changes  in  fluorescence  of  1 00  nM  pyrrolo-C-labeled  ssDNA  {inset  graph)  upon  the  addi¬ 
tion  of  1  piM  A3G  {black  line)  were  monitored  as  a  function  of  time.  The  pyrrolo-C  label  was  placed  1 0  nt  from  the 
3'-end  of  the  69  nt  ssDNA.  The  fluorescence  change  was  fit  to  a  double  exponential  equation.  C,  under  condi¬ 
tions  with  5  mM  MgCl2  the  fluorescence  change  began  with  a  rapid  rate  of  63  ±  8/s  (40%  amplitude)  and  was 
followed  by  a  slower  rate  of  7  ±  0.3/s  (60%  amplitude).  D,  in  the  absence  of  MgCl2,  no  fluorescence  change 
was  detected.  E and  F,  the  off-rate  of  A3G  (0.2  pM)  from  a  69-nt  fluorescein-labeled  ssDNA  substrate  (50  nM)  was 
determined  by  preincubating  A3G  with  ssDNA  {gray  line)  and  monitoring  changes  in  rotational  anisotropy 
following  the  addition  of  1 00 -200-fold  excess  unlabeled  69-nt  ssDNA  as  trap  {black  line).  The  anisotropy  of  free 
fluorescein-labeled  ssDNA  mixed  with  unlabeled  ssDNA  trap  is  denoted  with  a  horizontal  line.  The  off-rate  was 
fit  to  a  double  exponential  decay.  E,  in  the  presence  of  5  mM  MgCl2,  the  reduction  in  the  rotational  anisotropy 
signal,  representing  A3G  dissociating  from  fluorescein-labeled  DNA  and  binding  the  unlabeled  trap  DNA,  had 
an  initial  rate  of  decay  fitting  to  0.084  ±  0.005/s  (20%  amplitude)  and  a  second  decay  rate  of  0.00279  ± 
0.00006/s  (80%  amplitude).  F,  in  the  absence  of  IVlgCl2,  the  reduction  in  the  rotational  anisotropy  signal  had  an 
initial  rate  of  decay  fitting  to  0.01 1  ±  0.001 /s  (25%  amplitude)  and  a  second  decay  rate  of  0.0010  ±  0.0002/s 
(75%  amplitude). 


on-rates  reflect  different  interactions  of  a  single  A3G  form  with 
DNA  or  that  multiple  oligomeric  forms  of  A3G  exist  and  inter¬ 
act  differently  with  DNA. 

It  is  likely  that  when  an  A3G  molecule,  which  has  two  DNA 
binding  domains  per  monomer,  binds  to  ssDNA,  there  will  be 
multiple  binding  interactions  along  the  substrate,  which  may 
cause  a  conformational  change  in  the  structure  of  the  ssDNA. 
To  detect  conformational  changes  in  the  ssDNA,  such  as  base 
unstacking,  we  measured  the  presteady  state  change  in  fluores¬ 
cence  intensity  of  a  2'-deoxycytidine  analogue,  pyrrolo-dC, 
which  was  located  10  nt  from  the  3 '-end  of  a  69-nt  ssDNA  (Fig. 
4,  C  and  D),  Notably,  biphasic  changes  in  fluorescence  were 


observed  with  5  mM  MgCl2  (Fig. 
4C),  having  a  fast  rate  of  63/s  (40% 
amplitude)  and  slower  rate  of  7/s 
(60%  amplitude).  The  data  at  5  mM 
MgCl2  suggest  that  there  was  a  con¬ 
formational  change  in  the  ssDNA 
occurring  at  a  considerably  slower 
rate  (63/s;  Fig.  4C)  than  A3G- 
ssDNA  binding  (140/s;  Fig.  4A).  In 
contrast,  there  was  almost  no 
detectable  fluorescence  change  in 
the  absence  of  MgCl2  of  a  pyrrolo-C 
residue  located  10  nt  from  the 
3 '-end  (Fig.  4D). 

An  estimate  of  the  spatial  proper¬ 
ties  of  the  conformational  change 
can  be  made  by  measuring  the 
change  in  steady  state  fluorescence 
intensity  for  pyrrolo-C  residues 
placed  at  different  locations  along 
the  ssDNA  (supplemental  Fig.  S5A). 
There  was  a  significant  increase  in 
fluorescence  at  all  locations  along 
the  DNA  in  the  presence  of  MgCl2, 
with  maxima  occurring  at  10  nt 
from  the  5'-  and  3 '-ends,  corre¬ 
sponding  to  the  locations  of  the  flu- 
orophores  in  the  presteady  state 
analysis  (Fig.  4C  and  supplemental 
Fig.  S5,  A  and  B).  In  the  absence  of 
MgCl2,  there  was  essentially  no 
change  in  fluorescence  intensity 
along  the  DNA  except  when  the 
pyrrolo-C  was  located  10  nt  in  from 
the  5 '-end  (supplemental  Fig.  S5,  A 
and  C).  The  presteady  state  and 
steady  state  fluorescence  intensity 
measurements  are  mutually  sup¬ 
portive  and  suggest  that  a  confor¬ 
mational  change  in  the  DNA,  per¬ 
haps  indicative  of  DNA  wrapping, 
occurs  most  strongly  with  5  mM 
MgCl2  (supplemental  Fig.  S5A)  at 
—  15  ms  after  A3G  binds  to  DNA 
(Fig.  4,  A  and  C). 

A3G  must  remain  bound  to  the 
same  ssDNA  substrate  long  enough  for  it  to  act  processively 
(see  Fig.  3,  A  and  ^).  To  measure  the  residence  time  of  A3G 
bound  to  a  single  DNA  substrate  molecule  (Fig.  4,  E  and  F),  A3G 
was  prebound  to  fluorescein-labeled  ssDNA,  and  a  large  excess 
(100 -200-fold)  of  the  identical  unlabeled  ssDNA  was  added  as 
a  trap  to  bind  A3G  once  it  dissociated,  observed  as  a  decrease  in 
anisotropy  from  the  fully  bound  starting  point.  We  verified  that 
the  off-rates  were  independent  of  the  trap  concentration,  sug¬ 
gesting  that  the  unlabeled  DNA  acts  as  a  passive  trap,  not  as  a 
competitor  for  A3G  binding. 

The  off-rate  of  A3G  was  biphasic  with  a  rapid  initial  off-rate 
occurring  during  the  first  few  seconds  (—20%  amplitude),  fol- 
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FIGURE  5.  A3G  and  AID  deamination  of  oligo(A)  and  oligo(T)  substrates  containing  two  target  motifs. 

A3G  or  AID  was  incubated  with  a  70-nt  internally  fluorescein-labeled  ssDNA  substrate  with  two  CCC  or  AGC 
motifs  1 3  nt  apart  and  at  an  enzyme/DNA  ratio  of  1 :20.  Single  deaminations  of  the  5'  C  and  3'  C  are  detected  as 
the  appearance  of  labeled  52-  and  33-nt  fragments,  respectively;  double  deamination  of  both  Cs  on  the  same 
molecule  results  in  a  15-nt  labeled  fragment  (5'  C  and  3'  C).  A,  A3G  deamination  bias  is  affected  by  the 
sequence  composition.  The  ratio  of  5'  C/3'  C  deamination  is  shown  as  a  function  of  MgCl2  concentration  for 
each  variation  of  the  70-nt  internally  fluorescein-labeled  ssDNA  substrate  tested.  Error  bars  show  ±  1  S.E.  from 
three  independent  experiments.  Shown  is  A3G  deamination  on  an  oligo(A)  sequence  (AisCCCAgTi  A6CCCA36) 
{B),  an  oligo(T)  sequence  (T15CCCT13CCCT36)  (O,  and  an  ssDNA  (D).  E,  AID  deamination  on  an  oligo(A)  sequence 
(At5GCA7TiA6GCA36).  Substrate  usages  (%)  and  processivity  factors  are  shown  below  the  gels. 


lowed  by  a  much  slower  off-rate  occurring  on  a  time  scale  of 
minutes  (—80%  amplitude)  (Fig.  4,  E  and  F).  The  residence  time 
(reciprocal  of  the  off-rate)  for  the  A3G  rapidly  dissociating 
small  amplitude  phase  was  — 12  s  (0.084/s)  with  salt  (Fig.  4F') 
and  about  8-fold  longer  (—90  s)  without  salt  (0.011/s;  Fig.  4F). 
The  second  much  more  stable  binding  mode  was  characterized 
by  residence  times  of  about  300  s  and  1000  s,  in  the  presence 
and  absence  of  MgCl2,  respectively  (Fig.  4,  E  and  F).  It  is  likely 
that  the  two  distinct  off-rates  reflect  the  stability  of  different 
A3G-DNA  complexes.  The  key  point,  however,  is  that  A3G  can 
remain  bound  to  the  DNA  on  the  order  of  minutes,  which  pro¬ 
vides  ample  time  to  catalyze  the  correlated  deaminations  that 
take  place  within  5  s  of  a  single  A3G-DNA  encounter  (Fig.  3,  A 
and  B). 

A3G  Deamination  Polarity  Dependence  on  DNA  Topology — 
Target  CCC  motifs  were  embedded  in  homopolymer  A  and  T 


regions  to  investigate  the  effect  of 
DNA  topology  on  A3G  deamination 
bias  (Fig.  5).  Since  A3G  can  cause  a 
conformational  change  in  DNA  in 
the  presence  of  MgCl2  (Fig.  4C)  we 
expected  that  A3G  would  be  sensi¬ 
tive  to  DNA  topology.  Oligo(A) 
ssDNA  substrates  are  known  to 
have  more  rigid  conformations  than 
oligo(T)  or  ssDNA  (36).  When  act¬ 
ing  on  an  oligo(A)  substrate,  (A^^gC- 
CCAgTj^AgCCCAgg,  where  T  con¬ 
tains  fluorescein  label),  A3G 
deaminated  the  5'  C  with  the  same 
5'  C/3'  C  deamination  ratio  (—4) 
regardless  of  MgCl2  concentration 
(Fig.  5,  A  and  B).  This  result  is  in 
contrast  to  ssDNA,  where  A3G 
showed  no  deamination  bias  in  the 
absence  of  MgCl2  (Fig.  5,  A  and  D; 
see  Fig.  1C).  When  acting  on  the 
more  flexible  oligo(T)  substrate, 
(T^gCCCT^gCCCTgg),  A3G  exhib¬ 
ited  a  deamination  bias  in  the  pres¬ 
ence  or  absence  of  MgCl2,  but  in 
contrast  to  oligo(A)  and  ssDNA, 
there  was  a  marked  increase  in  the 
deamination  bias  with  increasing 
MgCl2,  going  from  a  5'  C/3'  C 
deamination  ratio  of  2.8  (no  salt)  to 
10  (10  mM  MgCl2)  (Fig.  5,  A  and  C). 
In  contrast,  the  APOBEC  family 
member,  AID,  deaminated  ssDNA 
in  an  unbiased  manner  in  the  pres¬ 
ence  and  absence  of  MgCl2  on 
ssDNA  (25)  or  on  the  oligo(A)  sub¬ 
strate  {A^^GCAyT ^A^GCA^g,  Fig.  5, 
A  and  F). 

The  Oligomerization  State  ofASG 
Depends  on  MgCl2  and  DNA — The 
presteady  state  kinetic  data  revealed 
biphasic  interactions  of  A3G  with 
DNA  (Fig.  4)  and  for  some  conditions  biphasic  deamination 
rates  (Fig.  3,  C  and  D).  This  could  mean  that  a  homogenous 
population  of  A3G  dimers  (26)  interacts  with  the  DNA  in  two 
modes  or  that  different  oligomeric  states  of  A3G  are  present 
and  exhibit  different  kinetic  rates. 

AFM,  which  yields  topographic  images  of  molecules  depos¬ 
ited  on  a  surface,  can  be  used  effectively  for  examining  the  oli¬ 
gomerization  states  of  proteins  (33,  34,  37-42).  Specifically,  it 
has  been  demonstrated  that  the  volume  of  proteins  (V)  meas¬ 
ured  by  AFM  exhibits  a  linear  dependence  on  molecular  weight 
[MW)  with  1/  =  1.2  X  MW  -  14.7  (33,  34).  Accordingly,  we 
used  AFM  to  characterize  the  oligomeric  state  of  A3G  (45.6 
kDa)  on  ssDNA,  oligo(A),  and  oligo(T)  substrates,  in  the  pres¬ 
ence  and  absence  of  MgCl2  (Fig.  6). 

A3G  alone  [Le.  in  the  absence  of  ssDNA)  in  the  presence  of 
salt  exists  as  a  mix  of  monomers  (—40%)  and  a  considerably 
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FIGURE  6.  l\/lgCl2  and  DNA  binding  modulate  the  oligomerization  state  of 
A3G.  AFM  was  used  to  determine  the  oligomeric  state  of  A3G  alone  in  the 
presence  of  5  mM  MgCl2  (A),  alone  in  the  absence  of  MgCl2  {B),  in  the  presence 
of  ssDNA  and  5  mM  MgCl2  (O,  and  in  the  presence  of  ssDNA  and  absence  of 
MgCl2  (D).The  oligomeric  state  of  A3G  was  also  determined  in  the  presence  of 
oligo(A)  and  5  mM  MgCl2  {E),  in  the  presence  of  oligo(A)  and  absence  of  MgCl2 
(F),  in  the  presence  of  oligo(T)  and  5  mM  IVlgCl2  (G),  and  in  the  presence  of 
oligo(T)  and  absence  of  MgCl2  (H).  The  70-nt  ssDNA  was  added  to  200  nM  final 
concentration.  A3G  was  added  to  1 00  nM  final  concentration.  Volume  distri¬ 
butions  of  A3G  are  plotted  against  the  percentage  of  total  proteins.  Total 
proteinscounted  were  as  follows:  1354  (A),  3168  (fi),  4231  (0,663  (D),  1563  (E), 
1 224  (F),  6738  (G),  and  706  (F/).  Representative  AFM  images  of  A3G  are  shown 
for  each  condition.  Images  are  300  X  300  nm  with  a  height  scale  of  5  nm. 


TABLE  1 

Distribution  of  A3G  oligomerization  states  in  response  to  MgCl2  and 
DNA 


Data  are  summarized  from  AFM  experiments  shown  in  Fig.  6. 


AFM  conditions 

Distribution  of  oligomerization  states 

Monomers 
(0-70  nmO 

Dimers 
(70-120  nm^) 

Higher  order 
oligomers 
(>120  nm^) 

% 

% 

% 

A3G  alone,  5  mM  MgCl2 

40 

14 

46 

A3G  alone,  0  mM  Mg  CI2 

23 

19 

58 

A3G  -1-  ssDNA,  5  mM  Mg  CI2 

23 

15 

62 

A3G  -1-  ssDNA,  0  mM  Mg  CI2 

51 

26 

23 

A3G  -1-  oligo(A),  5  mM  Mg  CI2 

26 

11 

63 

A3G  -1-  oligo(A),  0  mM  Mg  CI2 

24 

27 

49 

A3G  -1-  oligo(T),  5  mM  Mg  CI2 

21 

7 

71 

A3G  -1-  oligo(T),  0  mM  Mg  CI2 

43 

28 

29 

smaller  population  of  dimers  ('-14%),  along  with  higher  order 
oligomers  (—46%)  (Fig.  6A  and  Table  1).  A3G  alone  in  the 
absence  of  salt  behaved  differently,  showing  about  a  2-fold 
reduction  in  monomers  and  an  increase  in  the  combined  frac¬ 
tions  of  dimers  and  higher  order  oligomers  (Fig.  6B  and  Table 
1).  These  AFM  data,  which  visualize  A3G  in  the  absence  of 
DNA,  are  consistent  with  gel  filtration  data  showing  that  A3G 
exists  in  multiple  forms  (7, 15).  A3G  dimers  were  observed  with 
small  angle  x-ray  scattering  in  the  presence  of  NaCl  and  MgCl2, 
and  we  speculate  that  the  absence  of  higher  order  oligomers 
may  reflect  differences  in  buffer  conditions  (26). 

A3G  in  the  presence  of  salt  and  ssDNA  resulted  in  about  a 
2-fold  reduction  in  monomers  (—23%)  and  an  increase  in 
higher  order  oligomers  (62%),  relative  to  the  absence  of  DNA 
(Fig.  6,  compared  with  C,  and  Table  1).  These  data  (Fig.  6C)  are 
consistent  with  equilibrium  binding  data  for  A3G  association 
with  ssDNA,  which  show  a  slight  cooperativity  in  A3G  binding 
to  DNA  (15)  (supplemental  Fig.  S4).  The  polydispersed  nature 
of  A3G  on  DNA  (Fig.  6C)  is  also  in  agreement  with  previous 
data  showing  multiple  bands  in  gel  shift  analyses  (15,  31).  In 
contrast,  in  the  absence  of  MgCl2,  the  addition  of  ssDNA 
caused  a  shift  in  the  A3G  distribution,  resulting  in  2-fold  more 
monomers  (51%)  than  dimers  (26%),  whereas  in  the  absence  of 
salt  and  DNA,  the  monomer/dimer  ratio  was  —  1  (Fig.  6,  B  and 
A  and  Table  1).  The  2-fold  increase  in  monomer/dimer  ratio 
was  accompanied  by  a  concomitant  2-fold  reduction  in  higher 
order  oligomers.  These  data  indicate  that  in  the  presence  of  salt, 
binding  to  ssDNA  influences  A3G  to  associate  into  higher  order 
oligomers,  whereas,  in  the  absence  of  salt,  binding  of  ssDNA 
causes  the  A3G  to  dissociate  from  higher  order  oligomers  to 
predominantly  monomers  and  dimers. 

It  appears  that  different  oligomeric  states  may  be  responsible 
for  the  biphasic  ssDNA  binding  and  deamination  kinetics. 
Taken  together  with  Fig.  5D,  the  AFM  data  (Fig.  6,  C  and  D) 
suggest  that  the  different  oligomeric  states  present  when  A3G 
binds  ssDNA  under  different  salt  conditions  influence  whether 
A3G  acts  in  either  a  directional  or  random  manner,  outside  the 
30-nt  dead  zone  (Fig.  1,  A  and  B).  We  confirmed  that  this  correla¬ 
tion  is  also  found  when  the  AFM  conditions,  A3G/DNA  ratio  of 
1:2,  are  used  for  the  deamination  reaction  (supplemental  Fig.  S6). 
We  further  speculate  that  oligomers  promote  directional  deami¬ 
nation  and  monomers  promote  random  deamination  (compare 
Fig.  6,  C  and  D,  with  Fig.  5D;  see  Table  1). 
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The  observation  that  the  oligo(A)  and  oligo(T)  ssDNA  sub¬ 
strates  promote  the  deamination  polarity  of  A3G  without  the 
addition  of  MgCl2  (Fig.  5A)  suggests  that  binding  to  these  DNA 
substrates  may  influence  the  oligomeric  state  of  A3G.  To  test 
this  idea,  we  investigated  the  oligomeric  states  of  A3G  upon 
binding  to  both  oligo(A)  and  oligo(T)  ssDNA  substrates  in  the 
presence  and  absence  of  MgCl2.  The  distributions  of  A3G  with 
oligo(A)  were  similar  in  the  presence  and  absence  of  MgCl2 
(Fig.  6,  E  and  F,  respectively).  The  monomeric  species  were 
similar,  as  were  the  sum  of  dimeric  and  higher  order  species 
(Table  1).  The  data  are  consistent  with  A3G  exhibiting  compa¬ 
rable  deamination  polarities  on  oligo(A)  in  both  the  presence 
and  absence  of  salt  (Fig.  SB  and  supplemental  Fig.  S6).  A  com¬ 
parison  of  the  data  in  the  absence  of  MgCl2  for  no  DNA  (Fig. 
6F),  oligo(A)  (Fig.  6F),  and  ssDNA  (Fig.  6D)  reveals  that  ssDNA, 
but  not  oligo(A),  promotes  the  dissociation  of  A3G  to  a  pre¬ 
dominantly  monomer  population  (Table  1)  (2-fold  fewer 
monomers  with  oligo(A)  than  ssDNA)  in  accord  with  the  lack 
of  polarity  for  A3G  deamination  of  ssDNA  in  the  absence  of 
MgCl2  (Fig.  SD).  In  contrast  to  the  oligo(A)  data,  the  distribu¬ 
tions  of  A3G  with  oligo(T)  in  the  presence  and  absence  of 
MgCl2  are  different  (Figs.  6,  G  and  //,  respectively),  with  a  far 
higher  proportion  of  molecules  being  present  in  higher  order 
oligomeric  states  in  the  presence  of  salt  (71%)  compared  with 
the  absence  of  salt  (29%)  (Table  1).  These  relative  distributions 
agree  with  the  large  increase  in  deamination  polarity  for  A3G  on 
oligo(T)  with  increasing  MgCl2  concentration  (Fig.  5A,  supple¬ 
mental  Fig.  S6).  In  the  absence  of  MgCl2,  the  distribution  for  A3G 
binding  to  oligo(T)  DNA  is  similar  to  that  for  A3G  binding  to 
ssDNA,  except  that  the  monomer  peak  is  reduced  for  oligo(T) 
relative  to  ssDNA  (43%  versus  51%,  Table  1;  Fig.  6,  compare //and 
D),  This  shift  to  higher  order  oligomeric  states  is  consistent  with 
A3G  showing  polarity  on  oligo(T)  in  the  absence  of  MgCl2;  the 
ratio  of  5'  C/3'  C  deamination  is  —2.8  (Fig.  SA), 

DISCUSSION 

We  have  previously  shown  that  A3G  binds  randomly  to 
ssDNA  and  is  able  to  deaminate  C  residues  processively  with  a 
3'  — >  5'  bias  (15).  In  this  paper,  we  have  investigated  the  mech¬ 
anisms  of  processivity  and  favored  deamination  in  the 
5 '-ssDNA  direction  using  a  combination  of  kinetics  and  AFM 
analysis.  Both  attributes  of  A3G,  its  polarity  in  the  presence  of 
salt  and  its  processivity  in  the  presence  and  absence  of  salt, 
occur  at  the  shortest  times  (—5  s)  for  which  deamination  can  be 
detected  (Fig.  3,  A  and  B). 

Aside  from  the  importance  of  understanding  the  mecha¬ 
nisms  underlying  A3G  processivity  and  especially  polarity, 
which  occur  in  the  absence  of  an  external  energy  source  such  as 
ATP  or  GTP,  the  intrinsic  biochemical  properties  of  A3G  are 
important  biologically.  The  time  scales  of  A3G-DNA  com¬ 
plexes,  which  rapidly  form  and  remain  for  5-15  min  (Fig.  4,  A, 
B,  Ey  and  F)  suggest  that  in  vivo  A3G  has  the  potential  to  cata¬ 
lyze  sizable  numbers  of  processive  deaminations  on  (  — )-cDNA 
within  the  limited  time  that  it  is  single-stranded.  Notably,  in 
vivo  data  reveal  twin  5'  ^  3'  G  to  A  mutational  gradients  of  the 
HIV-1  genome,  which  means  that  deaminations  are  increasing 
toward  the  5 '-end  of  the  (— )-cDNA  in  two  regions  marked  by 
reverse  transcription  primer  sites  on  the  (  — )-cDNA  (17, 18).  It 


has  been  suggested  that  the  mutational  gradients  are  attributa¬ 
ble  to  the  temporal  effects  of  reverse  transcription  acting  in 
combination  with  RNase  H,  so  that  regions  that  remain  single- 
stranded  over  the  longest  time  frame  are  most  susceptible  to 
A3G-catalyzed  deaminations  (17, 18).  A  recent  analysis  of  edited 
HIV-1  genomes,  however,  suggests  that  the  intrinsic  3'  — >  5' 
deamination  bias  of  A3G  (15)  makes  a  substantial  contribution  to 
local  regions  of  HIV- 1  mutational  polarity  in  vivo  (17). 

Our  AFM  results  demonstrate  that  interactions  of  A3G  with 
ssDNA  and  salt  can  determine  A3G  oligomeric  forms.  Binding 
of  A3G  to  ssDNA  at  high  salt  promotes  the  formation  of  higher 
order  oligomers,  whereas  binding  at  low  salt  favors  the  pres¬ 
ence  of  monomers  (Table  1).  Monomeric  forms  of  A3G 
observed  by  AFM  in  the  presence  of  ssDNA  and  absence  of  salt 
(Fig.  6D  and  Table  1)  correlate  with  random  deamination  (Fig. 
5D),  whereas  oligomeric  forms  that  occur  in  the  presence  of 
ssDNA  and  salt  (Fig.  6C  and  Table  1)  correlate  with  a  3'  — >  5' 
bias  (Fig.  SD),  However,  the  polydispersed  nature  of  the  larger 
oligomers  of  A3G,  at  5  mM  MgCl2  (Fig.  6C  and  Table  1),  pre¬ 
cludes  the  identification  of  a  single  oligomeric  state  responsible 
for  the  deamination  polarity.  Nevertheless,  the  role  of  dimers, 
tetramers,  and  higher  oligomeric  forms  of  A3G  in  governing 
the  deamination  bias  is  strongly  supported  by  deaminations  of 
CCC  motifs  embedded  within  oligo(A)  or  oligo(T)  sequences. 
A3G  deaminates  with  a  3'  — >  5'  deamination  bias  even  in  the 
absence  of  salt  on  the  oligo(A)  and  oligo(T)  (Fig.  5,  B  and  C)  but 
not  on  ssDNA  substrates  (Fig.  5D),  and  the  higher  oligomeric 
states  of  A3G  are  more  stable  on  oligo(A)  and  oligo(T)  than  on 
the  ssDNA  in  the  absence  of  MgCl2  (Fig.  6,  compare  E  and  H 
with  D;  see  Table  1).  It  is  conceivable  that  larger  oligomeric 
states  form  concatamers  of  the  dimeric  form  (26)  and  probably 
involve  C-terminal  as  well  as  N-terminal  interactions,  in  accord 
with  the  APOBEC2  crystal  structure  (27),  but  we  cannot  resolve 
such  oligomeric  interactions  with  AFM  data. 

The  single  turnover  deamination  rates,  determined  with 
A3G  in  a  10-fold  molar  excess  over  ssDNA,  show  that  the 
MgCl2  concentration  dictates  the  intrinsic  catalytic  rates  of  C 
targets  at  different  locations  along  an  ssDNA  molecule.  Virtu¬ 
ally  identical  intrinsic  deamination  rates  exist  in  the  absence  of 
salt  for  target  motifs  located  outside  of  the  deamination  dead 
zone  at  57  and  36  nt  from  the  3 '-end  (Fig.  3D),  In  the  presence 
of  salt,  however,  the  single  turnover  deamination  rates  differ 
substantially  for  target  motifs  located  57  and  36  nt  from  the 
3 '-end  (Fig.  3C,  inset).  Comparing  the  deamination  rates  with 
the  slowest  off-rate  measured  in  the  presence  of  salt  (0.003/s; 
Fig.  4£'),  a  C  target  57  nt  from  the  3 '-end  of  ssDNA,  having  a 
deamination  rate  of  0.25/s  (Fig.  3C)  can  easily  be  deaminated 
before  A3G  falls  off  the  DNA.  However,  this  is  not  generally 
true  for  the  C  target  36  nt  from  the  3 '-end  of  an  ssDNA,  which 
is  characterized  by  biphasic  deamination  kinetics  (Fig.  3C). 
Roughly  25%  of  the  substrates  are  deaminated  at  0.12/s, 
whereas  75%  are  deaminated  at  0.004/s  (Fig.  3C),  the  latter  rate 
being  about  the  same  as  the  slowest  off-rate  (Fig.  4£').  Thus,  in 
the  presence  of  salt,  there  is  a  reduction  in  intrinsic  rates  of 
deamination  occurring  in  targets  nearer  the  3 '-end  that  seems 
to  account  for  the  3'  — >  5'  polarity. 

In  the  presence  of  salt,  the  slower  catalytic  rate  for  C-targets 
closer  to  the  3 '-ssDNA  end  (Fig.  3C)  may  stem,  in  part,  from 


MAY  16,  2008-VOLUME  283-NUMBER  20 


JOURNAL  OF  BIOLOGICAL  CHEMISTRY  1 3789 


Downloaded  from  www.jbc.org  at  USC  Norris  Medical  Library  on  September  14,  2008 


The  Journal  of  Biological  Chemistry 


Biochemical  Basis  ofASG  3'  -^5'  Deamination  Polarity 


Directional 


FIGURE  7.  Model  for  oligomeric  regulation  of  A3G  directional  and  ran¬ 
dom  deamination.  A,  directional  3'  ^  5'  deannination  in  the  presence  of 
"large"  A3G  oligomers.  Larger  oligomers,  shown  here  as  a  tetramer,  although 
dimers  may  also  be  present,  are  stable  when  binding  ssDNA  in  the  presence 
of  Mg^^  (Fig.6C).  In  the  presence  of  Mg^"^,  there  is  a  reduction  in  the  intrinsic 
rates  of  C-deamination  toward  the  3'-DNA  end,  which  accounts  for  the  3'  ^ 
5'  polarity  (Fig.  3,  C  and  D).  The  catalytically  inactive  N-terminal  domains  are 
depicted  asyellowdashed circles.  Jhe  hot  active  site  {red circle)  is  aligned  in  the 
catalytically  favored  3'  ^  5'  orientation  and  closest  to  the  preferential  C  for 
deamination  in  a  5  '-CCC  target  motif,  whereas  the  cold  active  site  {blue  circle) 
is  aligned  in  the  opposite  orientation  nearest  the  barely  deaminated  C  in  a 
5'-CCC  target  motif.  Active  sites  located  internally  {cross-hatched  circles)  can¬ 
not  effectively  access  CCC  motifs  and  are  essentially  noncatalytic.  The  com¬ 
bination  of  inaccessible  CCC  motifs  and  the  cold  active  site  {blue  circle) 
located  toward  the  3'-DNA  end  results  in  an  inhibition  of  deamination  rela¬ 
tive  to  the  easily  accessible  CCC  motifs  toward  the  5'-DNA  end.  CCC  motifs 
located  within  the  ~32-nt  dead  zone  at  the  3'-DNA  end  are  deaminated  with 
at  least  a  1 0-fold  lower  activity  compared  with  motifs  outside  the  dead  zone 
located  near  the  5'-DNA  end  (Fig.  ^A).  We  envision  that  although  the  A3G 
oligomer  can  slide  and  jump  bidirectionally  along  the  ssDNA,  multiple  A3G- 
DNA  binding  interactions  may  wrapthe  DNAaround  A3G  (Fig. 40  and  restrict 
free  diffusion  of  the  oligomer,  thus  reducing  the  possibility  for  resampling 
nondeaminated  C  targets.  B,  random  deamination  performed  by  a  monomer. 
Larger  oligomers  are  disrupted  upon  binding  ssDNA  in  the  absence  of  Mg^^ 
(Fig.  6D).  The  A3G  monomers  can  diffuse  freely  along  the  ssDNA  and  thereby 
deaminate  each  CCC  motif  with  roughly  similar  efficiencies  outside  the  dead 
zone  at  the  3'-DNA  end.  The  monomer  retains  partial  catalytic  asymmetry  by 
deaminating  with  at  least  a  15-fold  lower  activity  on  CCC  motifs  within  the 
~30-nt  dead  zone  at  the  3'-DNA  end  compared  with  motifs  outside  the  dead 
zone  (Fig.  ^B).  The  dead  zone  at  the  3'-DNA  end  appears  to  provide  an  opti¬ 
mal  binding  region,  allowing  A3G  to  align  in  a  catalytically  favored  3'  ^  5' 
orientation  to  access  the  preferred  C  of  a  5'-CCC  motif. 

reduced  access  of  the  CCC  motifs  to  enzyme  active  sites.  The 
reduced  catalytic  rate  toward  the  S'-DNA  region  occurs  in  con¬ 
junction  with  the  formation  of  larger  oligomers  in  the  presence 
of  salt  (Fig.  6C  and  Table  1).  We  speculate  that  active  sites 
situated  within  an  oligomer,  such  as  a  tetramer,  are  unable  to 
access  and  deaminate  C  targets  efficiently.  However,  we 
hypothesize  that  A3G  oligomers,  including  dimers,  are  direc¬ 
tionally  biased.  If  multiple  binding  interactions  were  to  wrap 
the  DNA  around  A3G  (Fig.  4C)  and  restrict  free  diffusion  of  the 
oligomer,  then  a  second  level  of  bias  is  added.  The  intact  oli¬ 
gomer  would  still  be  able  to  diffuse  bidirectionally,  encompass¬ 
ing  sliding  and  jumping  motions,  but  three-dimensional  diffu¬ 
sion  would  become  increasingly  constricted  as  the  oligomer 
size  increased,  limiting  sampling  of  the  ssDNA  to  find  all  pos¬ 
sible  C  targets.  We  have  incorporated  these  ideas  in  a  schematic 
model  (Fig.  7 A).  The  elongated  shape  of  the  A3G  protein  con¬ 
tained  in  the  sketch  is  based  on  small  angle  x-ray  scattering  of 
A3G  in  aqueous  solution  (26). 

We  further  suggest  that  there  is  a  polarity  of  the  C-terminal 
active  sites  at  each  end  of  the  A3G  oligomer  so  that  they  do  not 
deaminate  with  equal  rates  (Fig.  7A,  red  circle  (high  rate)  and 
blue  circle  (low  rate)).  There  are  three  observations,  in  addition 
to  previous  data  (15),  that  support  an  asymmetric  catalytic  ori¬ 
entation  of  A3G  so  that  active  sites  facing  the  5' -DNA  end  (Fig. 


7A,  red  circle)  catalyze  C  — >  U  deaminations  much  more  effi¬ 
ciently  than  those  facing  the  3' -DNA  end  (Fig.  7A,  blue  circle). 
First,  there  is  a  30-nt  region  that  is  deaminated  with  extremely 
low  efficiency  (deamination  dead  zone)  located  at  the  3 '-end  of 
linear  ssDNA  (Fig.  1,  A  and  B).  The  30-nt  dead  zone  may  pro¬ 
vide  an  optimal  space  for  A3G  to  encounter  a  CCC  motif  in  an 
orientation  allowing  catalysis.  Second,  directionally  biased 
deamination  occurs  on  a  partially  dsDNA  circle  in  the  presence 
but  not  absence  of  salt,  whereas  directional  deamination  does 
not  occur  on  an  ssDNA  circle  either  with  or  without  salt  (Fig.  2), 
implying  that  A3G  needs  to  approach  a  CCC  motif  with  a  spe¬ 
cific  orientation.  Third,  there  is  a  biased  orientation  in  each 
5' -CCC  motif,  where  A3G  prefers  to  deaminate  the  3'-C  resi¬ 
due  (5' -CCC)  much  more  than  the  5'-C  residue  (5' -CCC)  even 
under  single  turnover  conditions  (Fig.  3£).  Therefore,  we  have 
depicted  the  “hot”  active  site  facing  the  5' -DNA  end  and 
aligned  closest  to  the  favored  C  for  deamination  in  a  5' -CCC 
motif  (Fig.  7A).  Conversely,  the  “cold”  active  site  is  facing  the 
3' -DNA  end  and  aligned  closest  to  the  barely  deaminated  C  in  a 
5' -CCC  motif  (Fig.  7A).  Although  A3G  has  not  been  previously 
modeled  with  C-terminal  domains  external  to  an  oligomer  (26), 
there  is  evidence  that  a  “monomeric”  C-terminal  deaminase 
domain  of  A3G  is  catalytically  active  (29,  30),  which  suggests 
that  trans  subunit  complementation  common  to  the  cytidine 
deaminase  family  (43)  is  not  required.  We  favor  this  model, 
because  it  seems  to  be  the  most  probable  asymmetric  arrange¬ 
ment  that  can  account  for  all  aspects  encompassing  the  bio¬ 
chemical  characteristics  of  A3G  3'  — >  5'  deamination  polarity 
(Fig.  7A). 

Random  deaminations  take  place  in  the  absence  of  salt.  Con¬ 
comitantly,  A3G  oligomers  that  form  in  the  absence  of  salt  (Fig. 
6B)  favor  the  formation  of  monomers  in  the  presence  of  ssDNA 
(Fig.  6D  and  Table  1).  A3G  is  illustrated  as  a  monomer  in  the 
absence  of  salt  (Fig.  7B),  although  A3G  dimers  may  also  form 
(Fig.  6D  and  Table  1).  It  is  likely  that  the  A3G  monomers  are 
free  to  diffuse  randomly  on  the  ssDNA  substrate  and  deaminate 
CCC  motifs  essentially  irrespective  of  target  location,  since 
there  is  not  a  strong  conformational  change  indicative  of  DNA 
wrapping,  in  the  absence  of  MgCl2  (Fig.  4D),  which  would  oth¬ 
erwise  limit  resampling  of  undeaminated  C  targets  (Fig.  7A). 

For  our  study,  the  high  salt  state  has  provided  mechanistic 
insights  into  the  determinants  for  3'  — >  5'  deamination  polarity. 
In  vivo,  there  are  likely  to  be  multiple  forms  of  A3G  in  higher 
and  lower  order  oligomeric  states.  There  is  evidence  that  A3G 
exists  in  high  molecular  mass  or  intravirion  A3G  complex  and 
low  molecular  mass  states  when  associated  with  and  when  dis¬ 
sociated  from  RNA,  respectively  (7,  16,  26).  Although  A3G  is 
incorporated  into  virions  as  an  intravirion  A3G  complex, 
unable  to  deaminate  (  — )-cDNA,  RNase  H  releases  the  inhibi¬ 
tory  HIV  RNA  during  reverse  transcription  to  form  the  active 
low  molecular  mass  form,  characterized  as  monomers  and/or 
dimers  by  gel  filtration  (16).  However,  the  form  that  the  low 
molecular  mass  takes  when  binding  DNA  has  not  been  func¬ 
tionally  characterized  apart  from  our  study,  and  it  is  likely  that 
in  vivo,  the  dimeric  and  larger  oligomeric  states  formed  with 
A3G,  DNA,  and  salt  (Fig.  6C)  are  relevant. 

In  vivo,  the  intrinsic  polarity  of  A3G  deamination  makes  a 
substantial  contribution  to  local  regions  of  HIV-1  mutational 
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polarity  (17),  which  suggests  that  higher  order  oligomers  of 
A3G  are  acting  on  ( —  )-cDNA.  Apart  from  deamination  activity, 
A3G  may  also  have  the  capacity  for  blocking  reverse  transcrip¬ 
tion.  A  recent  in  vitro  study  has  shown  that  A3G  can  block 
reverse  transcriptase  DNA  elongation  by  competitively  binding 
RNA  or  ssDNA  segments  of  the  HIV-1  genome  or  (  — )-cDNA 
with  a  higher  affinity  than  reverse  transcriptase  (8),  a  finding 
supported  by  A3G  inhibition  of  HIV-1  in  resting  CD4+  T  cells 
(7).  Our  data  suggest  that  a  potentially  key  aspect  of  the  mech¬ 
anism  inhibiting  reverse  transcriptase  would  involve  the  forma¬ 
tion  of  higher  oligomeric  states  of  A3G.  Higher  oligomeric 
forms  of  A3G  can  presumably  interact  with  many  regions  on 
the  DNA,  which  is  likely  to  inhibit  access  by  other  enzymes, 
including  reverse  transcriptase.  Here  the  assembly  of  variably 
sized  oligomeric  forms  offers  two  distinct  ways  that  A3G  can 
inhibit  viral  propagation  when  binding  to  (  — )-cDNA,  a  cata¬ 
lytic  mode  creating  biased  “hot  spots”  for  deamination  (17)  to 
ensure  mutational  inactivation  of  HIV  and  a  noncatalytic  mode 
involving  higher  ordered  structures,  which  can  block  the  action 
of  reverse  transcriptase  (8). 
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